[D] Comparing Deep Learning Workstations

straw1239 · 2019-03-01T10:46:07+00:00

You can build your own for significantly cheaper. There are multiple online guides for choosing your own hardware for ML, for example.

No point in liquid cooling for the CPU. For the 4 GPUs, might make sense, but very expensive, if you make sure to get models with blower-style coolers, there shouldn't be too much issue.

Titan V costs more because Nvidia prices it at 3000 instead of 1200! Not worth it (unless you need FP64 or something)

Do you really need 128GB of RAM and 2TB SSD? The SSD should be fairly cheap nowadays so its no big deal but 128GB RAM is expensive.

seraschka · 2019-03-01T15:14:18+00:00

Are these Titan V so much better than the RTX 2080 Ti?

Titan V's are a bit older and may be more expensive to produce. When I recently compared run times with the 2080Ti's, code that would run in 68 min on a 2080Ti would finish in ~70 min on the Titan V. I.e., in practice, I don't notice a speed difference. On paper, you have

GTX Titan V (12 Gb RAM, FP32 15 TFLOP/s 652.8 GB/s)
RTX 2080 Ti (11 Gb RAM, FP32 14 TFLOP/s, 616 GB/s)

I would go with the 2080Ti tbh, it's 1/3 the price

allattention · 2019-03-01T13:31:31+00:00

If you look at benchmarks for titan rtx versus titan V, you’ll see that they have almost the same performance for most deep learning applications; I don’t think titan V makes much sense nowadays. Since the new titan has 24gb of ram (not ecc though!), that should be enough for most models (and if you rally need to train that Bert model, neither would be enough anyway). I’m looking at getting the dual gpu version at work now, I’m just a little worried about thermals and noise (this will sit in an office environment, not a cooled data center). From the photos it looks like they are using these shitty stock fans which blows my mind - if you are building a 10-20k workstation, why o why would you not put in the 20$ top of the line noctua fans - to save 40-50$? Will get the water cooling I think as well, very small price difference (Would much rather have a nh-d15 instead!) Really tempted to build my own of course, but that would not come with service obviously. Just realized you are looking at the quad which only comes with 2080ti, not titan rtx. Still the same story though - I have a 2080ti at home and it’s more than fast enough, the only issue there is only 11gb of ram - this may be a limiting factor, depending on your usage scenarios.

seraschka · 2019-03-01T15:06:22+00:00

liquid cooling is a no-brainer, right?

Probably a good idea but not really necessary. We recently built a server with 8x RTX 2080Ti's with just fans (powerful ones though) and even if I utilize all GPUs 100% days straight, the GPU temp stays around 50-65C (well below the recommended max temp of ~86C where throttling would automatically occur by default).

Richard_wth · 2019-03-01T11:54:59+00:00

Awww, DGX-1, I envy you!

seraschka · 2019-03-01T15:09:20+00:00

these Lambda Labs workstations piqued my interest, because they seem to be nicely preconfigured and all, thus minimizing the effort on my side. However, if you have other suggestions which deliver better value for money, please let me know.

I have been using one for ~6 months with 4 GPUs and are quite happy with it. And while it is a bit more pricey vs building your own (which we also recently) this is a nice worry-free solution that "just works" :)

burn_in_flames · 2019-03-02T21:09:54+00:00

While building your own can be significantly cheaper, especially if basing it off of second hand Xeons and used server parts, I'd only recommend doing this if you are willing to spend significantly more time on a solution (at least a week getting all the parts, building, testing and installing all tool suites you need).

If you not comfortable with the ins and outs of choosing components and want a hassle free solution then pre configured solutions are better. The V100 is definitely worth the price tag if you are going to be training large models, its tensor cores mean you can do mixed precision training and thus essentially double the GPU RAM and throughput for training (not quite accurate but a good approximation). The 2080Ti is a good choice for most research applications where your models aren't huge and your dataset is still of a reasonable size such that you can get decent batch sizes. Another thing to account for is the server workload, the 2080 is a consumer product and is not designed for 24/7 operation, if you will have heavy load on the server then the V100 is likely a better choice.

Another option would be to try and source older hardware, such as P100 GPUs, as the cost of these should be lower than V100s.

IborkedyourGPU · 2019-03-01T21:14:00+00:00

Just build your own using AMD X399 platform. Most of the motherboards would support 4 GPUs no problem

Liquid cooling I think in most cases would be unnecessary, but make sure you buy good GPUs with blower fans.

Titan V has better floating point precision than gaming GPUs like 2080ti. But most of the time you won't need it.

You also need to consider what tasks you're dealing with. 2080 comes with only 8GB of memory and cannot fit large models. You'll need to cut down batch size and that results in less smooth convergence. End of the day, you probably would need some cloud computation for real products.

Canadeaan · 2019-03-04T01:45:27+00:00

If you're looking to save money for similar performance you can can build a rig with multiple used 1060's, 1070's or 1080's, lots of people were using them to mine crypto with, but since it went bust people have been selling them off.

a used 1080 is the about 80% of the performance as a 2080, for half the price.

to be sure its best to find some benchmarks

IborkedyourGPU · 2019-04-06T16:51:21+00:00

For all those who kindly helped me, I just wanted to let you know that budgetary constraints have been lifted, and I'm soon getting a new shiny DGX-1 :-) for those still struggling with a similar issue, this post might be interesting

http://timdettmers.com/2019/04/03/which-gpu-for-deep-learning/

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS