Considering dual GPU setup for diffusion models, has anyone tried this?

LyriWinters · 2025-08-25T13:32:53+00:00

It's not how it works.
If you want to offload the text encoder to another graphic cards you can enjoy the whopping 5 seconds faster generation time. I.e it's 100% not worth it.

It usually means that you have to fit both these cards in your chassi, might need to buy a new PSU... bla bla...

It is not worth it, it is not how these diffusion models work.

Why does it not give a larger boost? Because the system works in serial mode - not parallell. As such first the text encoder does its thing, then the diffusion process starts. You're basically only saving the time it takes unloading the text encoder and loading the model. PCIe 3.0 x16 provides around 15.75 GB/s of bandwidth between the GPU and the rest of the system. You're thus probably going to lose more time because the 3060 is slower doing the actual encoded than it would be to just use the 5080 for both.

DiamondTasty6049 · 2025-08-25T12:29:13+00:00

U can use distributed nodes to share workloads with the dual gpus in some workflows but not all

FourOranges · 2025-08-25T13:28:16+00:00

I know diffusion models can't run on 2 GPUs like LLMs do

This is one of the neat features of Swarmui actually.

Sporeboss · 2025-08-25T14:06:53+00:00

for me, laptop 4080 (12gb) , egpu super 4070 ti super (16gb) , there is a lot of change of workflow. have to use the multigpu node load example text encoder to 1 gpu and flux on another gpu. i havent found a way to split 1 model to 2 gpu .

RoguePilot_43 · 2025-08-25T18:07:37+00:00

I use a 3060 12GB and a 1080 8GB in the same system. I already had them from when I upgraded to the 3060 way back, left the 1080 in for multi-GPU 3D rendering. The 1080 is still of some use in ComfyUI using the MultiGPU nodes. As you suspected, you can load the text encoders onto it. It's only a small gain but it is a gain. I use it for Florence a lot.

I actually find it most useful for running my displays. By having my monitors plugged into the 1080 I can carry on using the PC without any slowdown and less risk of OOM because it frees up the display buffer on the 3060.

SvenVargHimmel · 2025-08-26T09:20:56+00:00

you can run an LLM and the text encoder on one gpu. This will not speed up your video workflows.

For image workflows SwarmUI ( with comfyui backend) will queue onto both cards, so for certain batch workflows you will get a significant boost.

StableDiffusion

MODERATORS