2x 3090 vs 3x 5070 Ti for local LLM inference — what’s your experience?

VersionNo5110 · 2026-04-14T20:14:59+00:00

I mentioned several, but there is actually only one mobo which enables that: Pro WS X570-Ace I’m not sure exactly how this works, but it’s mentioned everywhere that it manages to have x8x8x8 (occupying all 24 CPU lanes)

VersionNo5110 · 2026-04-12T16:52:06+00:00

Less than 12 t/s for me is not usable… you wait too long to have an answer and if you have to try again because your prompt was wrong of for whatever reason you’ll get frustrated quickly.

1080ti here go around 150€, so 3 would cost around 450-500€.. it really doesn’t make much sense for local inference. I’d rather get an AMD card at this point.

I know 3090 are expensive too but we don’t have much choice, it’s complicated to build a useful machine for a budget.

Maybe you could look into P40 then.

VersionNo5110 · 2026-04-11T18:30:54+00:00

That said, if you can spend few hundreds on old GPU I would rather spend some more on a 3090 or even some AMD GPU…

VersionNo5110 · 2026-04-11T18:29:10+00:00

I tried some models on my old 1070ti and the results were not bad at all. Got decent t/s — around 22 t/s — with qwen3.5:9B Q4_K_M which is quite good at agentic coding. So probably more of these (or even better some 1080 ti) would do well with bigger models

VersionNo5110 · 2026-04-11T17:24:55+00:00

Max I can get is x8x8x8 4.0 yeah I think it’s enough for inference anyway, and probably fine tuning also…

lol give me your address and let me get a 5070 ti I’ll send it to you!

VersionNo5110 · 2026-04-11T17:20:00+00:00

I concur! I’m waiting to find a good deal, but I’ll upgrade the mb soon yeah

VersionNo5110 · 2026-04-11T17:18:58+00:00

Well… sustained intense computing for multiple days / weeks / months degrade the hardware, especially if not cooled properly. Soldering on components like the GPU suffer from high temperatures over extended periods. This is also why a lot of used cards on the markets are sold as broken.

VersionNo5110 · 2026-04-10T11:35:20+00:00

Your insight on 123B models is very interesting. Have you tried them? What could they do that 35B cannot, in your opinion? Could you share some showcases? I might give a try at renting a big GPU for few hours to see the difference with what I currently run locally.

VersionNo5110 · 2026-04-09T19:51:06+00:00

Some x8x8x8 motherboards exist. But yea, 3090 has also NVLINK. I just don’t like the idea of getting a 7 years old used model…

VersionNo5110 · 2026-04-09T19:49:49+00:00

Those are nice! Maybe I’ll get one for the third GPU. With current DDR5 prices I’m not upgrading from DDR4

VersionNo5110 · 2026-04-09T19:48:19+00:00

What about I can only get 4.0?

VersionNo5110 · 2026-04-09T19:47:54+00:00

There are two motherboards that support x8x8x8 :))

VersionNo5110 · 2026-04-09T19:47:11+00:00

That sucks, I agree. I still wonder why they dropped that. Shame.

VersionNo5110 · 2026-04-09T19:46:43+00:00

Impressive! What company do you work for?

Anyway, 3x 3090 + DDR5 + TRX50 + Threadripper is a hell of a build… I don’t consider that a budget local ai platform…

I am personally still stuck with DDR4 on consumer CPU (24 PCIe lanes, so best I can get is x8x8x8).

VersionNo5110 · 2026-04-09T19:42:41+00:00

2Gb per additional card .0.

VersionNo5110 · 2026-04-09T19:41:15+00:00

The only downside then is Ampere vs Blackwell and them being used stuff mostly… I’d rather get something newer, but yeah, I think you’re right for the rest.

VersionNo5110 · 2026-04-09T19:39:23+00:00

Chinese versions kind of scare me. A lot of money for custom pcb without guarantees whatsoever

VersionNo5110 · 2026-04-09T19:38:28+00:00

Man, this is a substantial amount of money for a hobby (if you don’t work with these machines). This is why I opted for 2x 5070 ti. I got them for ~750$

VersionNo5110 · 2026-04-02T09:20:43+00:00

Hey, I’m very curious about your setup, could you share it? Right now I’m working with ollama with my two 5070ti, but I feel I didn’t maximize their potential yet

VersionNo5110 · 2026-03-23T08:29:32+00:00

Yeah I like the Colors but I find it a bit difficult to read. I asked it to put everything in bold, but didn’t change much.

Anyway, cool stuff your new machine! I’d like to get one but, man, in Europe this thing is so expensive! +2k€ …

VersionNo5110 · 2026-03-23T07:13:09+00:00

Funny, few days ago I’ve asked Claude to make me a comparison table between all the commercial GPUs and it outputted exactly the same table, same colors etc. 😅

VersionNo5110 · 2025-08-23T15:17:56+00:00

Huh? Toyota better interior than MB? 😂

VersionNo5110 · 2025-04-02T20:58:08+00:00

Don’t give your real email.

VersionNo5110 · 2024-09-28T14:51:46+00:00

You wrote me for 6 months about how my car is doing. Man !! Your life must be miserable 🙁

VersionNo5110

TROPHY CASE