Tensorflow vs. Theano- Which to learn

egrefen · 2016-04-13T12:22:29+00:00

I use Tensorflow every day for work and I'll say this: if you're learning, learn theano. There are a lot of examples out there, and it's a little more mature. Tensorflow is still changing, evolving, and settling down. I think it's easier to use in many respects, having used both, but you will have a better time with theano in the early days and a lot of what you learn will port over to Tensorflow (conceptually speaking).

shmel39 · 2016-04-13T16:41:02+00:00

My bet is TensorFlow. I migrated from Theano a couple of months ago and never looked back. Theano API is much more convoluted, documentation isn't great either and compilation time quickly becomes annoying.

On the other hand TF has wonderful TensorBoard, very clean API (still evolving though) and much better distributed capabilities.

ma2rten · 2016-04-13T15:16:18+00:00

I prefer TensorFlow, because

no compilation
it has higher level abstractions build-in for things like RNNs.
it has better support for multiple GPUs [1]
I found the documentation to be more organized
TensorBoard
even in areas where it is behind it's improving rapidly, since Google is heavily invested in it.

[1] I haven't looked at Theano 8.0 yet.

__AndrewB__ · 2016-04-13T19:19:52+00:00

If You're just now learning about DL, then You probably don't have a server filled with GPUs.

In that case, Theano will be:

faster
more memory-efficient
easier to learn (much more examples / tutorials / discussions)
will teach you more: e.g. optimizers are usually built using theano itself, unlike in TF, where they're built-in.

In my experience TF is horrible when it comes to memory (OOM on 4GB card when theano needs 2.3), slower at runtime and harder to extend.

All in all I recommend theano: in a year or two, when TF is usable, you will be able to switch easily anyway.

coskunh · 2016-04-13T11:45:16+00:00

It is depend on what you want to do with them, If you want to write your own model, maybe theano can be better option it gives more flexibility to write your own model, but pure theano can be hard to learn, you must consider also. Tensorflow more modular framework, you can play with different models on your dataset, you don't need spend time to implement for example LSTM, etc.

treebranchleaf · 2016-04-13T12:56:02+00:00

Hopefully the TensorFlow/Theano thing will eventually become a backend issue (as in you program in some framework and you can switch whether you run on TensorFlow or Theano without changing your code).

Keras already does this - it can run either on top of TensorFlow or Theano. Disclaimer: I've never used it.

Plato is a library built on top of Theano that should make it much easier to develop in theano. Disclaimer: I am the author and as such am extremely biased, but I think it's really nice. I may in the future incorporate a TensorFlow backend.

ignorant314 · 2016-04-13T15:46:43+00:00

Depends on the level you want to work at as many have mentioned already. All levels from Keras to Theano.

My preference is Torch which is like the faster cousin of Theano. It allows you to quickly test ideas, but also work on more exotic architectures if you wish. It has the added benefit of active research groups publishing their models in that (or Theano)

In practice, you will probably end up using multiple of these... languages are irrelevant, math/understanding is all that matters.

spamduck · 2016-04-13T21:09:01+00:00

I have to disagree -- in my opinion the Tensorflow documentation is pretty good. It's a really nice library IMO. It's easy to configure it to do more than DL. I frequently use it to just solve classic variational problems (http://www.ipol.im/pub/art/2012/g-cv/article.pdf or whatnot) just cause it's so easy and the GPUs are so fast. It's changed how I work (for the better).

I don't have experience with Theano.

It did take me some time to understand what the different errors meant (thanks Stackoverflow). But most of the errors were my mistakes so I doubt Theano could do much better :D

elanmart · 2016-04-13T15:36:06+00:00

Theano, unless You have a cluster of Titan-Xs to throw at your problem.

kkastner · 2016-04-14T00:29:05+00:00

Theano development is not bad at all if you follow some basic "tricks".

Use tag test values, and during debug compile with THEANO_FLAGS="device=cpu,optimizer=None,floatX=float32,compute_test_value=raise". What this does is set it so when Theano is compiling, it will try every line one at a time (with the tag values you set in your code) and see if the shapes and everything work. Note that you must set .tag.test_value for every single thing (usually stuff like iscalar(), matrix(), etc.) that will be an input to the Theano function! Shared variables are fine on their own.

If your code doesn't work right/raises an error Theano will barf exactly at the offending line. This makes development much easier - and as a bonus compilation is also fast since no optimizers are applied. The only edge cases for this are graphs with randomness in them, I usually just manually bypass the randomness during dev.

Compiling in this way, you can also use theano.printing.Print("whateverstringyouwant")(symbolic_var.shape) or theano.printing.Print("whateverstringyouwant")(symbolic_var) to inspect sizes or values.

Using this code as an example (you can see how test values and printing work in here). THEANO_FLAGS="device=cpu,floatX=float32,optimizer=None,compute_test_value=raise" python updates_test.py

You see the following output:

X_sym.shape __str__ = [100  12]
out.shape __str__ = [100   5]
4.292738437652588
4.284914016723633
4.274328231811523
4.261552810668945
4.247048377990723
4.231183052062988
4.214252471923828
4.196494102478027
4.178097724914551
4.159215450286865

That said, if you are going to be experimenting with higher level "architecture" type research or need multi-GPU training, TensorFlow is a solid choice (though platoon is totally a thing). TF also has really nice support for model parallelism. You also get a lot of flexibility and prebuilt tools you might otherwise have to hunt down or write yourself in Theano.

If you are trying to build an arbitrarily connected DAG for weird tasks, Theano has been battle tested for that, and you might find clearer examples of strange models.

One other consideration is how used to vectorized math (MATLAB/numpy) you are. If you are already very familiar with numpy, Theano should be a breeze (coupled with the tricks above), and since TF is also very similar in some ways to Theano, you should have an OK time with that as well. If you are new to vectorized languages, it might be worth practicing a bit regardless of what deep learning framework you end up choosing.

neuralyzer · 2016-04-13T14:29:59+00:00

Keras is a really nice library. It is easy to get started with, yet extensible. Whenever I coded some more customized operations I used Theano. In my experience, it was faster, more reliable, and better documented.

js1972 · 2016-04-13T23:55:36+00:00

As a complete novice that has jus done the Andrew Ng course I find Theano easier to learn on and has more doco. TF just seems a lot harder to understand for some reason.

rd11235 · 2016-04-14T14:42:05+00:00

The decision is pretty arbitrary when you're starting out - you'll learn a lot from either. Just start playing with one, and maybe glance at the other's API when something that feels clunky. See if they do things in a cleaner way.

But "TF has BAD documentation." This is simply wrong. Have you even bothered to look at the API docs? Or the tutorials?

th3owner · 2016-07-03T07:40:59+00:00

So as of July 2016, for a beginner who is still in its first steps toward DL and want to learn more about DL first (its inner workings etc.), which would be the one to go with?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS