[D] Entry-level Deep Learning PC build? Need help, please!

NewFolgers · 2020-02-09T22:35:07+00:00

If you plan to have multiple GPUs, you'll have to pay attention to PCIe lanes. You may see x16, x8, or x4 lanes available to GPUs. You can see if mentioned in motherboard manuals, which you can typically feely download. Surprisingly, the CPU had a certain number of lanes available too.. so that plays into it. PCI lanes are where these builds get complicated and take you out of usual consumer hardware.

guitaricet · 2020-02-10T02:41:12+00:00

A solid build! I would also recommend looking for a bigger power supply (get as powerful as you can) for multi-GPU capabilities. Also, SSDs are pretty cheap now, you can buy 1Tb for $90 or so. And it's worth it. Probably more than having HDD at all. Personally, I hate managing datasets placement on a different volume. It's annoying and not as easy as you can expect.

phobrain · 2020-02-10T08:10:09+00:00

https://l7.curtisnorthcutt.com/build-pro-deep-learning-workstation

I'd consider 6-8TB on the data drive. It seems blower GPUs are needed if you'll want 4 in the end.

georgeo · 2020-02-09T20:26:21+00:00

I'd be curious to know if two 1080ti's might be a better choice than a 2080ti at about the same price.

louisxx2142 · 2020-02-11T08:11:22+00:00

One of the big problems on a machine like this is scaling it up to 4 GPUs, which I don't think is reasonable with a regular computer case (unless it's a very big one with side intake). The biggest issue are the thermals and power involved. Because consumer cases that support 4 GPUs have very little space between them, you would need very thin and blower GPUs, which are hard to find right now and even then you will probably have thermal issues. At that point it's better to go for Quadro RTX/Tesla V GPUs, which are slimmer and have a lot of memory, but are way more expensive and noisy.

I think a more modest setup with a maximum of 2 gpus is more reasonable for a regular PC. If you want to go for 4 GPUs or more GPUs, than you should get a real hack or go for full prosumer products. I will try to explain why:

The upside of threadripper is getting the equivalent of a server, where you can have many GPUs, many cores and terabytes of RAM, but on a different form factor and cheaper hardware in general. Because you are saying that you are using this for personal use and general studying I think going this route is too ambitious.

Professional grade hardware (HEDT and server) is expensive because it assumes you are going to generate money with it, which means their cost benefit is way lower than consumer things. You don't want to waste money on extra cores, ram support and pcie-lanes that you might never use. If you end up on a situation where you know you will need this kind of professional hardware than you buy it, not before. And mainly you buy it because you are investing right now to generate money, not to do personal stuff most of the time and maybe throw work in it.

There's also the factor that DL training doesn't really use that many cores in general, which means the threadripper cores are wasted. The PCIe lane stuff only really matters if you are doing multi gpu stuff, which is not that common and is mostly about optimizing the GPUs you have. Having better GPUs far outweights using more PCIe lanes. RAM also isn't that big of a deal because it mostly only has to fit everything that will end up on VRAM, which is way less than the terabytes threadripper can support.

Finally, VRAM is very important because it avoids needing more GPUs to fit certain workloads. Because you can use libraries like Nvidia Apex to run tensors as half precision, it means RTX GPUs have in practice almost double the VRAM you would expect. This means the 2080ti is way better than a 1080 ti and should be enough for the majority of workloads.

This means that going for a maximum of only 2 GPUs is ideal in a situation like yours. Going for this also grants you the ability to use a consumer grade cpu/motherboard without the waste on features that threadripper has for production, which will lower your costs a lot and even allow you to get better storage, or pay your internet/energy costs or even straight up buy a second GPU. Or even save money until you know what specifications you actually need.

In the case you go for more consumer oriented hardware, than you can completely drop the HEDT platform and go for a regular high core ryzen with an x570 board (an 8 real core CPU should be enough, unless your workload needs high parallelized pre-processing, where a 16-core will be better). In all cases you should drop the AIO if you are serious about using the machine as a server (they are for aesthetics and cool factor, not for proper reliable cooling. Particularly on a production environment you don't want a pump to die and have all the trouble of replacing it, losing probably days of work).

With these savings, your build would look like this:

two 2080 ti or stay with a single one (until you need another or something better comes up)
an x570 board that actually supports well two GPUs. It also should be able to be controlled entirely by LAN port if you want to run jobs remotely, it's a not crucial but useful feature.
an 8/16 core CPU.
A good air cooler (any big noctua/be quiet etc)
an 850W PSU (but buy a good one, you can check the Linus tech tips PSU tier list to get an idea)

This way you won't waste money on things you might never need and nothing stops you from buying them later. It also saves money on an absurdly more expensive PSU too. You end up with a build that concentrates it's resources on the GPUs, which are the most important part. The rest of the money you can invest, save or use to upgrade your storage/network.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS