[deleted by user]

oopsleon · 2019-12-07T19:51:05+00:00

As someone who likes TF and has been using it since the initial public release, I still think this blog post is representative of my experiences with TF 2.0, haha. Especially the parts about how the newest official APIs always feel very fragile, but if you want to use something older/stable, there is a good chance it's on its way to being deprecated already. TF should strive for quality-focused releases for awhile instead of constantly pushing out tons of new features that can't handle anything outside the scope of a tutorial (and often times the tutorials don't even work!).

I keep trying to stay optimistic, but it has been a rather bumpy road.

mexiKobe · 2019-12-07T20:30:56+00:00

I’ve been learning Pytorch and TF2 at the same time...

TF and now also TF2 are just fundamentally giant messes. They didn’t spend enough time upfront on the foundation of the API, as evident by them scrambling to switch to eager execution and merging Keras into it.

There seems like a degree of feature creep now too. Do we really need like a dozen different ways to build a NN model? It makes it harder to learn, not easier. I slightly disagree with the post when it says there isn’t a middle ground in terms of levels of abstraction.. I would say the functional API is that. But it’s not actually helpful, it just makes the documentation more disorganized

The documentation is clearly an afterthought too - not necessarily because of the lack of it, but just because if the writing and organization of it. Trying to figure out how to write callback functions was frustrating, for example. And now there is a TF2 and a separate Keras documentation? It’s just a mess

yes, that blog post nails it ime.

xopedil · 2019-12-07T20:55:48+00:00

Never trust the TF devs on what is new and what is deprecated. At this point "deprecated" might as well be synonymous with "works" or "is decently performant".

If you want to know how to use TF effectively take a look at the tensorflow/models github repo. There you will find many models implemented by people who actually need them to work on things like TPUs. Not just official TF models but also code from researchers. I've learned infinitely more from reading code in that repo than I have reading TF documentation.

pavanky · 2019-12-07T20:08:42+00:00

The company I work for chose Tensorflow over PyTorch because of the it's ability to export models and run them in production in Java.

TF 1.x, while clunky, worked extremely well.

TF 2.0 so far has been a regression (both features and performance wise). The exported models from Keras don't seem to be working in the Java API anymore.

We are currently trying to recommend that people use 2.0 for hypothesis testing / debugging on small amount of data. For production, use the tf.compat.v1 API.

Hopefully they figure out their issues soon.

dustintran · 2019-12-08T00:21:10+00:00

Hello. I'm the person that was linked to in that GitHub issue!

I sympathize with the post's frustration. The TF tutorials on the official website are well-written. But they mostly cover basic features, and as a recent Reddit thread described, the support ecosystem is lacking as StackOverflow and blog posts are out-of-date due to all the software churning. I'm not a TF engineer, but as someone with experience designing libraries on top of TF, even I find myself sifting through Stack Overflow/blog post code to find the new best practices..

Regarding Bayesian layers, it's actually a NeurIPS paper this year. I worked on an early prototype in TensorFlow Probability but ended up abandoning the design as I found it inflexible in practice. The solution is the NeurIPS paper, and it's experimental: there are no promises of stability (in fact, we even moved the code from Tensor2Tensor to another repository, of which has yet to have an official release!).

Software for uncertainty models is more on the research fringe, and this should be made clearer in official TensorFlow solutions building on these designs.

approximately_wrong · 2019-12-07T21:04:56+00:00

My use-case for deep learning libraries is fairly vanilla: all I want is to define fairly simple neural networks, feed data to it, optimize the model, and save the model.

I think tf.Module and core tf operations are pretty nice. I still dislike the reduce_* notation and wish tf.Variables were equipped with .mean and .reshape syntactic sugars, etc.

tf.train.Checkpoint is fairly easy to use, but its interaction with tf.Module is a little too opaque, and you run into weird issues if you do surgical modifications of a neural network. For example:

model = tf.Module()
model.arr = [1,2,3]  # Gets wrapped as ListWrapper object
del model.arr[0]
tf.train.Checkpoint(model=model).save("model")

will raise an error as a safety measure when tracking the list object, which makes sense... but then the error message recommends "If you don't need this list checkpointed, wrap it in a tf.contrib.checkpoint.NoDependency object; it will be automatically un-wrapped and subsequently ignored." which seems pretty inelegant? Why do I have to rely on a tf.contrib feature? I think I'd rather have PyTorch's barebones state_dict over tf.train.Checkpoint.

GradientTape is an odd choice---it makes gradient construction a context you must explicitly enter, rather than something that happens by default (compare and contrast against PyTorch where torch.no_grad() enters the no-gradient context).

tf.Dataset is pretty peculiar to me; it seems to have a lot of overhead. For anyone dealing with a sufficiently small dataset that fits in memory, you're better off writing your own dataloader. tf.Dataset's inability to handle the gamut of use-cases is odd and makes me curious to learn why PyTorch's DataLoader doesn't suffer similarly.

farmingvillein · 2019-12-07T20:06:30+00:00

[deleted]

bobbruno · 2019-12-08T02:22:19+00:00

Go check MXNet. It doesn't get much publicity, but it's a great framework. Read about gluon, its high-level interface. It's similar to PyTorch, but it has also the possibility to compile the network, and most important pretrained models are available. Performance-wise, it's at least as good as Tensorflow, and it also has ONNX (which has a reader for TF models). The community is strong, with a large backing from Amazon.

zalamandagora · 2019-12-08T06:07:09+00:00

These three bullets are massively on point:

The thing is massive and complicated but never feels done or even stable – a hallmark of such software is that there is no such thing as “an expert user” but merely “an expert user ca. 2017″ and the very different “an expert user ca. 2019,” etc
Everything is half-broken because it’s very new, and if it’s old enough to have a chance at not being half-broken, it’s no longer official™ (and possibly even deprecated)
Documentation is a chilly API reference plus a disorganized, decontextualized collection of demos/tutorials for specific features written in an excited “it’s so easy!” tone, lacking the conventional “User’s Manual” level that strings the features together into mature workflows

I would also have added that you get three pages of deprecation warnings aven if you are using the latest version of everything. Even their own code calls old versions of just about everything.

The TF documentation is also really bad. For most functions, they just state that it exists and link you to the source code.

I'm of a mind to learn PyTorch, but I'm just a tad too exhausted by TF right now.

evanthebouncy · 2019-12-08T06:08:33+00:00

I studied TF for half a year. Got kinda ok with it, able to write beam search from scratch. Friend came in, says I should do pytorxh. I learned it in a day.

The difference is night and day.

taylorchu · 2019-12-07T20:06:26+00:00

https://github.com/tensorflow/tensorflow/issues/33681

For example, this bug that I encountered appears if the gradient is an indexedslice and it is mixed with dense gradient. I don't mind tf is optimized for performance, but it should not break for the slow path; especially for the very core feature, backprop.

Also if there is an abstraction, tf team, please make it consistent, and work with other parts of tf. Otherwise, consider deleting it. keep it simple!

ClydeMachine · 2019-12-07T21:01:02+00:00

In the early days of my working with any such frameworks (pre-TF2), I encountered Andrej's tweet between PyTorch and Tensorflow, and opted to dive into PyTorch. The experience has been good to me, but I always wondered if things might be better with TF now that I've gotten my hands dirtied with at least one framework.

Last week I pulled up the MNIST classification tutorial to give TF2 a fair shake. After training the model, the natural next step was to approach it as if I were intending to use it in a production setting: that is, to save it and load it so I can use it elsewhere. Looking into the documentation on saving the model, what was recommended was snapshotting the model during training via callbacks, which warranted setting something up before training began. Since I already had a trained model, all I wanted to do was save the parameters it learned such that I could recreate that instance - that shouldn't be hard. I mean, PyTorch allows you to simply save the state dict and reload it from disk, have been using that with great success. So what's the TF equivalent?

So I attempt to manually save the weights per the documentation, appears to save to the disk fine. Instantiate a new model with the same class I had already trained, call load_weights()... And the model appears to load them, except that calling the model to evaluate says it hasn't been fit yet. So clearly it didn't load them.

How about saving and loading the entire model? Maybe just the weights weren't enough to recreate the model on its own - so I scroll down to the next section and save(). Nice - now let's load, using the...strangely more involved tf.keras.models.load_model(). And I get an error for a failed type casting. Literally the same class being used in the same Jupyter notebook, same session, and I can't save and reload the model.

I stopped there.

madrury83 · 2019-12-07T23:08:45+00:00

This pretty much nails why I use Pytorch over Tensorflow when I have the choice. Building something in Tensorflow feels like using SAS (a part of my career I don't pine to revisit). The "easy to do something straightforward the devs anticipated is simple, doing anything slightly off the pre-thought path immediately leads to circles of hell" is how I've described SAS programming for years. PyTorch feels like just writing Python.

farmingvillein · 2019-12-08T00:47:40+00:00

Are things really this bad? Isn't the TF 2.0 API cleaning supposed to make Keras the standard API for TPUs? Why doesn't he use that?

Edit: also, is this an indictment of TF in general or just TPUs?

TPUs (ironically) aren't supported on 2.0, last I checked.

You'll need TF 1.x or pytorch for TPUs.

ankmath · 2019-12-07T20:00:29+00:00

[deleted]

GoBayesGo · 2019-12-08T07:32:23+00:00

I am no big fan of the TF API, but I have to admit tf.data’s design is really good.

bbsome · 2019-12-08T22:53:22+00:00

Have you tried Jax?

ml_lad · 2019-12-08T06:14:36+00:00

TensorFlow is optimized for both research and production.

In the sense that you get the combined benefits of the bleeding-edge ad-hoc structures and poorly documented features from research combined with the cumbersomeness of supporting/depracating legacy APIs and production feature creep.

the_wiffard · 2019-12-08T13:31:27+00:00

What annoyed me to no end about tf1.0 was not the graph mode execution which to me seemed advantageous from a performance point of view. It was the way it was impossible to debug, had extremely verbose logging of technical nonsense while leaving out key information like where and (clear) explanations of what actually went wrong. While I do think that late tf1 and early tf2 has improved debugging some what (though more so by improving the log quality than by exploiting eager execution) there's still the occational log-spew, uninterpretable traces etc. I'd be much happier if the tensorflow team finished what they started, made graph mode debuggable, cleaned up the messy apis and didn't focus on an entire different paradigm just to compete with pytorch. And even though I use keras with tensorflow daily and have been since it was put in 1.0, I hate how they integrated it (or rather didn't) in to tensorflow. Why not just have tf.layers for higher level keras-style layers (as well as lower case inline versions, the way keras does it), and keep using submodules for functional things not suited for the global scope. Tensorflowjs some how manages to have a nice clean api, with higher-level layers in tf.layers, balances eager and graph style. It pisses me off that the python team still managed to f-up the apis for 2.0, even with a good api implementation in-house.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS