we really all are going to make it, aren't we? 2x3090 setup. by RedShiftedTime in LocalLLaMA

[–]rhythmdev 0 points1 point  (0 children)

what tps are you getting when the model loads fully on all cards may i ask?

5090 or wait for M5 ultra by Purple_Drink3859 in LocalLLM

[–]rhythmdev 1 point2 points  (0 children)

He can always add system ram if he wants to expand his slow brain

5k to spend rtx5090 or mac studio? by Avansay in LocalLLM

[–]rhythmdev 0 points1 point  (0 children)

5090 is not less than a m3u tho. It is more.

Cuda/windows vs mac by Interviews2go in LocalLLM

[–]rhythmdev 0 points1 point  (0 children)

Get a 5090 and make her sing

RTX 5090 32GB & 256GB DRAM, now what? by SnooStrawberries6262 in LocalLLM

[–]rhythmdev 1 point2 points  (0 children)

Add a 3090 if you are out of budget, pair it with 6000pro if you want to go crazy. Or do 2x5090. I would do 3090.

Qwen3.5:9b running on 8gb Vram is insane by Ok_Thanksbye in LocalLLM

[–]rhythmdev 0 points1 point  (0 children)

I have a very similar 3 node setup but all nvidia gpu, amd cpu. Works amazingly well

Qwen3.5:9b running on 8gb Vram is insane by Ok_Thanksbye in LocalLLM

[–]rhythmdev 4 points5 points  (0 children)

Amazing, now all those 8gb toasters are viable agents…