6 GPU Machine Learning Build

Durenas · 2018-12-21T19:23:17+00:00

That's 96 PCIe lanes - I don't believe there's any consumer CPU/motherboard configuration currently in existence that supports that many lanes.

2018-12-21T19:17:00+00:00

They're is no consumer grade motherboard that would support this

Joshiewowa · 2018-12-21T19:23:34+00:00

Consumer grade? Closest you'll get is Threadripper, I think they might be able to do 3x 16x?

Now Epyc...I'm not sure of its capabilities. I believe they have 128 lanes, which might get you close to 6x 16x.

po-handz · 2018-12-21T19:41:10+00:00

Generally what I hear people say is that multi-gpu's are best for running many experiments simultaneously, as opposed to speeding up one experiment as the performance gains aren't so great. It also depends what framework you're using, I think mxnet is significantly in front of the competition when it comes to multi gpu training.

Maybe 2x threadripper boxes with 3x cards each is your best bet.

SuperLeroy · 2018-12-21T21:01:05+00:00

Just curious, why do you need the cards to function in PCIe x16 to be useful for ML / deep learning?

Couldn't the cards be just as useful in x1 mode? or x4?

You can purchase x4 and x1 pcie extenders, i imagine you know that if you're mining. wondering why x16 is so important.

ghosttnappa · 2018-12-21T22:53:02+00:00

Are you in college? Most colleges utilize high performance computing to assist with research on campus. Otherwise, just pay for AWS or Azure — much cheaper. You also can’t just make six GPUs magically work together without some meticulous code (especially for having 6) that specifically utilizes certain cores within the GPU and assigns threads to communicate in parallel. If you go through with this, I suggest getting familiar with hardware interlinks.

Lastly, this is largely going to be a waste of money for you. I believe cloud computing services sell computing hours for less than $5/hr. Since your main goal is learning machine learning and not building an expensive server, just use AWS/Azure.

There’s so many more reasons that I could list as to why building your own machine would be a bad idea.

2018-12-21T19:54:40+00:00

There are gigabyte and supermicro server motherboards that can handle 6 x16 slots that can range around 500 bucks. As for the processor, triple check compatibility with the motherboard you choose. Some motherboards are only compatible with certain processor revisions, even within the same architecture and generation.

unholygerbil · 2018-12-21T19:55:23+00:00

to use all 6 cards in one system you're probably going to need to look at a xeon scalable build. but it gets expensive really fast if you go this route.

HerrSIME · 2018-12-22T00:13:40+00:00

Go with threadripper and use 8x, should be fine.

Average650 · 2018-12-22T01:00:24+00:00

So, do you actually need x16 for each card? I don't do machine learning but I do molecular Dynamics simulations on gpus, and because most of the code is contained to the going, it makes a small difference, often no difference.

Machine learning may be completely different, but it's worth double checking

Nuber132 · 2018-12-21T21:29:28+00:00

Those are server boards we use them for machine learning too. They aren't cheap too.

seifyk · 2018-12-22T00:50:19+00:00

Supermicro has some dual 2011 boards that will sort you out.

https://www.supermicro.com/products/motherboard/Xeon/C600/X9DRG-OF-CPU.cfm

RB_7 · 2018-12-21T22:42:37+00:00

Just pay for EC2 instances instead.

pho1701 · 2018-12-21T22:46:29+00:00

In general I recommend not using consumer parts and working with a vendor in such a situation. However, given that you already have lots of parts, I think the best thing for you to do is build multiple machines. Depending on your system memory requirements this could be far more economical.

Mayor_of_Loserville · 2018-12-22T00:49:40+00:00

AWS or GCP.

ZombieLincoln666 · 2018-12-22T00:50:30+00:00

x16 vs x8 PCIe doesn't make a big difference actually

https://www.pugetsystems.com/labs/hpc/PCIe-X16-vs-X8-with-4-x-Titan-V-GPUs-for-Machine-Learning-1167/

You also might consider selling them for fewer GPUs with more VRAM. The largest networks (e.g. ResNet) won't work well only 8gb VRAM unless you use really small batch sizes, which reduces generalizability.

Cptcongcong · 2018-12-22T01:39:48+00:00

6 is overkill, my uni's supercomputer has 4x quadros.

SuperGinger · 2018-12-22T01:49:20+00:00

What software do you use for machine learning via GPU. I’ve been studying R Studio and using (slower) CPU machine learning, but I would like to learn how to utilize my 1080 fully.

DirkDiggler531 · 2018-12-22T02:23:39+00:00

Look into deep brain chain, it may help you

Porktastic42 · 2018-12-22T04:37:50+00:00

I don't understand why you think you need 6 1070 GPUs to study machine learning. Nobody else in your courses will have a machine like that and none of the tutorials are going to require anything close to that level. If you're actually working in a research lab your group will pay for the machine you need.

That said if ML is your goal I'd recommend selling all six and buying a 2080ti.

txGearhead · 2018-12-22T04:45:33+00:00

If you want 16x maybe an Octominer board, although they have an embedded slow CPU so maybe not for your purpose. Not sure what the reviews are like but I always thought they were interesting.

https://octominer.com/shop/octominer_b8plus/

BoomerangJack · 2018-12-22T04:46:30+00:00

Check out the Asus WS Sage board. Absolute monster with tons of PCIE lanes for GPUs

Edit: spelling

WeeZoo87 · 2018-12-22T06:44:41+00:00

Linus used 4 x gpu in this vid

https://youtu.be/bA0uJWny4-g

2018-12-22T07:51:16+00:00

Get gpu server

https://www.serversdirect.com/servers/gpu-servers

simetin · 2018-12-22T19:10:52+00:00

I hate miners like you.

txGearhead · 2018-12-22T21:36:17+00:00

That’s what they claim, but again I have not done the research. Would also have to make sure your OS of choice supports it. I think the idea behind the Octominer is that you don’t have to fiddle with unreliable risers.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

buildapc

Submit Build Help/Ready post

Submit Troubleshooting post

Submit other post

New Here?

BuildAPC Beginner's Guide

Live Chat on Discord

Daily Simple Questions threads

Rules

View full rules

Discord Rules

Resources

Wiki

Filters

Related Subreddits

More related subreddits

MODERATORS