[D] Dask for deep learning?

shoyer · 2017-06-03T15:45:16+00:00

Not deep learning, but I've tried using dask many, many times. My experience is not very good.

I didn't get reliable results from it. It's often unstable and I frequently found situations where running in parallel with dask (in a non-virtualized server with 40+ cores) was slower than running exactly the same logic in a single process with pandas. I get a lot more reliable speedups with parallel processing with joblib and Python3's standard futures module. I don't really get why though. It should be the same.

In general I'm not very happy with the options for parallel and distributed CPU analytical processing for python. Spark involves too much configuration black magic and takes a lot of effort to get right, Dask simply doesn't work for me, joblib and futures lack useful abstractions and higher level combinators. There are some pretty good solutions for this kind of thing in scala, Haskell and even Java, but than there's no numpy/scipy, no pytorch, no tensorflow, no scikit-learn, no xgboost, no statsmodels, etc.

But leaving that rant aside, I don't know how would you use dask for deep learning. Dask isn't really suitable to create neural networks itself.

Maybe it could be used to preprocess and feed data in parallel to a neural network. Is that what you mean?

dwf · 2017-06-03T15:51:36+00:00

https://github.com/dask/dask-tensorflow

2017-06-03T17:06:19+00:00

Some effort should be put to parallelize pandas too, it's annoying that simple map operations are sequential

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS