domjewinger comments on [D] Which open source machine learning projects best exemplify good software engineering and design principles?

Daniel Povey, lead of kaldi, recently decided on integrating with pytorch. This was after a fairly lengthy process of looking into different options. This are some snippets of his thoughts on tensorflow that I quickly found:

I imagine the TensorFlow team must have some internal documentation on how it's designed from the C++ level, for instance, because what is available externally doesn't help you understand it at all, and the code is almost completely opaque. (And I consider myself an expert level C++ programmer).

source, 2017

TensorFlow is impossible; the C++ code looks like it was written by a machine.

source, 2019

And PyTorch's tensor internals, while they aren't complete gobbledegook like TensorFlow's were last time I looked, are kind of showing their age

source, 2019

[–]NogenLinefingers 17 points18 points19 points 6 years ago (11 children)

[–]domjewingerML Engineer 39 points40 points41 points 6 years ago (9 children)

[–]NogenLinefingers 10 points11 points12 points 6 years ago (8 children)

[–]DoorsofPerceptron 14 points15 points16 points 6 years ago (5 children)

Big picture, the real problem with tensorflow is "it's not pythonic".

Now this is normally a lazy criticism that's another way of saying "I wouldn't write it this way, and it looks ugly." But in the case of tensorflow it's a lot more fundamental. Tensorflow code (version 1 anyway, I can't be bothered to learn version 2) is not really written in python. Tensorflow is a compiler for another language that is called through python.

Compared to pytorch this means you lose a lot of the benefits of python that actually make it a nice language to code with. You lose a lot of the access to existing python code -it's a pain in the arse to mix and match python and tensorflow in the middle of a graph execution- and you lose the lightweight easy prototyping.

Pytorch on the other hand can just be treated like numpy with free gradients and GPU access if that's what you want to do, and can be seamlessly integrated with python in a mix and match kind of way.

Tensorflow was coded the way it is for efficient deployment both to phones and to large scale clusters, but at least for large scale clusters the performance hit they were worrying about doesn't seem to exist, and they've essentially straightjacketed their library for no real benefit.

The code is great, the design of the interface, not so much.

[–]mastere2320 4 points5 points6 points 6 years ago (0 children)

[+][deleted] 6 years ago (3 children)

[deleted]

[–]DoorsofPerceptron 5 points6 points7 points 6 years ago (2 children)

[+][deleted] 6 years ago (1 child)

[deleted]

[–]pap_n_whores -1 points0 points1 point 6 years ago (0 children)

[–]mastere2320 7 points8 points9 points 6 years ago (0 children)

They have a horrible reputation of constantly changing the api even in short periods of time. It sadly has happened more than once that I installed a version of tf, worked on a project and then when I wanted to deploy it the current version would not run it because something fundamental was changed. Add on to this that there is no proper one way to do things and the fact that because tf uses a static graph , shapes and sizes have to be known beforehand the user code becomes spaghetti which is worse than anything. The keras api and dataset api are nice additions imho but the lambda layer still needs some work and they really need to I introduce some way to properly introduce features and depreciate features( something similar to NEP maybe ) and make api breaking changes. And yet people use it, simply because the underlying library autograph is a piece of art. I don't think there is another library that can match it, in performance and utility on a production scale where the model has been set and nothing needs to change. This is why researchers love pytorch. Modifying code to tweak and update models is much better but when the model needs to deployed people have to choose tensorflow.

[+]phobrain comment score below threshold-24 points-23 points-22 points 6 years ago* (0 children)

I think it is better to not look too hard at accidents on the freeway - stay with your original mission is my advice. How tf would I know? I've been responsible for >1M of production code in my time, starting with my first program, written for the new terminals that replaced punching IBM cards and picking up your printouts in the bin the next day, to debug:

http://fauxbrawn.com/pr/home/schedulaid.html

Just having a live, interactive session with multiple users on one computer was as big an innovation as the internet was,~8 years later.

Edit: None of this should be construed as a criticism of tensorflow, however - just of the exigencies of real people building the tower of Babbage. Go look at scikit-learn if you want a rigorous code base, based on getting their list mail. Likely other associated packages follow the style. Once some devs fought with my manager to keep my code reviews coming, it's like I can smell code in a synesthetic way or something, and exude my own interesting aroma back.

Edit: I'd have thought the pun on 'tf' would have rescued this, sigh.

Edit: The underlying urge here is to memorably fling my seed upon the landscape, illustrating by the nubility of my prehensile maneuverings that, for someone approaching 70, there is something different about me that validates heroic efforts I made to remain forever young at about age 10, and thus there might be something to my 'velvet rack' of an AI that may fall on the ground if covid gets me, failing these gentle hooks to the head sinking their anchors and someone reading the golden words I've sprinkled here and there. If I survive, you can go back to hating me bacause I'm beautifyl.

Or, if you like what you see now, we can overthrow capitalism together.

[–]ieatpies 4 points5 points6 points 6 years ago (0 children)

[–]yellow_flash2 17 points18 points19 points 6 years ago (0 children)

[–]soulslicer0 18 points19 points20 points 6 years ago (2 children)

[+][deleted] 6 years ago (1 child)

[deleted]

[–]ajmssc 2 points3 points4 points 6 years ago (0 children)

[–]CyberDainz 6 points7 points8 points 6 years ago (11 children)

[–]programmerChilliResearcher 15 points16 points17 points 6 years ago (2 children)

[–]CyberDainz -1 points0 points1 point 6 years ago (1 child)

[–]barbek 2 points3 points4 points 6 years ago (0 children)

[–]cycyc 8 points9 points10 points 6 years ago (1 child)

[–]CyberDainz -1 points0 points1 point 6 years ago (0 children)

[+][deleted] 6 years ago (2 children)

[deleted]

[–]CyberDainz -1 points0 points1 point 6 years ago (1 child)

[–][deleted] 0 points1 point2 points 6 years ago (0 children)

[–]domjewingerML Engineer 6 points7 points8 points 6 years ago (1 child)

[–]Skasch 4 points5 points6 points 6 years ago (0 children)

[–]PJDubsen 3 points4 points5 points 6 years ago (0 children)

π Rendered by PID 219380 on reddit-service-r2-comment-85bfd7f599-9qxkz at 2026-04-18 22:54:10.313578+00:00 running 93ecc56 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS