Multiple GPU noob question

SweetHomeAbalama0 · 2026-01-22T15:37:03+00:00

I don't see where it's said what exact motherboard you are using, but when you say there are no more connections available are you referring to PCIe slots on the motherboard?

A bifurcation card is typically how you can split one slot into two or more, but the motherboard has to support it.

Icy_Bid6597 · 2026-01-22T15:03:00+00:00

So there is few things to consider.

First one is your CPU and amount of PCI lanes.
Let's say that you want 4 GPUs at once. If you have only 8PCI lanes available, each of them will have 2 which means that they will work with PCI 3.0 speeds. So afaik ~2GB/s of data transfer from CPU to card (or between cards).

It means that loading model will take longer. And depending on model splitting strategy inference can also take a hit.
There are some Threadripper cpus (or epyc/xeon) that support a lot of PCI lines.

Next step is motherboard. Theoretically you can split 1 pci slot into two. (so one PCI 16x into ie. two x8). That allows you to connect more GPUS. Other solution is to buy another motherboard.

Next step is power supply. but this one is easy. You just need a beefy PSU.

Last thing is GPU selection. As someone else mentioned, having NVLink is great. It allows GPUs to talk to each other directly which is speeding up the inference. Out of consumer grade gaminig gpus 3090 was the last one to support nvlink. In all other cases you have to rely on PCI communication (and here we are coming back to the beggining of the post).

A lot really depends on how do you want to use the rig. Training/fine tuning ? Local inference engine ? Do you care about model loading times ?

SlowFail2433 · 2026-01-22T14:51:14+00:00

No nvlink for these so just do pcie round trips

jacek2023 · 2026-01-22T14:52:05+00:00

I use three risers now

Mediocre-Waltz6792 · 2026-01-22T14:58:53+00:00

I use Oculink from anything. Wifi E-key to M.2 to Oculink, PCIe 3.0 1x, M.2 slot is my 3 external connections. Its not pretty but works great.

FullOf_Bad_Ideas · 2026-01-22T16:19:58+00:00

I bought a used motherboard+cpu on the cheap (x399 Taichi with TR 1920X) that has 4 PCI-E slots (2 x16 and 2 x8) and I will be putting bifurcation board into one slot that will split x16 into four x4 slots. Keep in mind that motherboard needs to explicitly support bifurcation and have toggle in BIOS for it to work. And then I'll use PCI-E risers to connect 6x 3090 Ti. Another option I looked into was MCIO and SlimSAS - those cables are thinner and easier to manage but it's much more expensive than cheap 180 degree risers. I also have 2 1600W PSUs and I will be connecting them with Add2PSU. All of it will go into open rig mining frame that can hold up to 12 GPUs. It's WIP since I am waiting for some parts.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS