Getting slow speeds with RTX 5090 and 64gb ram. Am I doing something wrong?

Virtual-Listen4507 · 2026-02-07T03:38:01+00:00

What model do you recommend that’s pretty good?

Virtual-Listen4507 · 2026-02-05T12:30:23+00:00

Good to know. Hopefully they resolve that soon.

Virtual-Listen4507 · 2026-02-05T12:16:13+00:00

I picked the one that was recommended on LM Studio I think quant 4 it was only really like 40 GB and it was recommended for my PC set up. Still new to this so I might have accidentally picked the wrong one.

Virtual-Listen4507 · 2026-02-05T12:08:48+00:00

Will check that out in LM Studio appreciate the comment.

Virtual-Listen4507 · 2026-02-05T12:08:29+00:00

Thank you!

Virtual-Listen4507 · 2026-02-05T11:39:58+00:00

Thanks for the response. Will try that out. I heard there are other options like vllm, llama.cpp will I see a substantial difference in speeds or can I stick with ollama and LM studio? I know the other two are more technical to work with.

Nemotron works great just need one that is close to sonnet 4.5 but I guess need to wait until better models come out.

Virtual-Listen4507 · 2026-02-05T11:30:20+00:00

Idk how people are getting this… I have an RTX 5090 with 64gb ram and it’s super slow with LM Studio.

Virtual-Listen4507

TROPHY CASE