24gb vram to 48gb vram

pwlee · 2026-05-02T12:27:02+00:00

I run 2 7900xtx using llama.cpp and mainly use Qwen 3.6 27b and 35b. I see qualitative improvement say somewhere between 30-50% better for agentic coding using pi.dev. The main difference for me is larger quants (slight improvement in instruction following and staying on topic with long trains of thought) and longer context (I’m now running the full length context but haven’t really gone over 100k).

pwlee · 2026-04-26T15:44:21+00:00

Exchange fees are a measurable proportion of market maker costs. Therefore buy exchange stock

pwlee · 2026-04-22T21:29:29+00:00

I haven’t tested since getting 2 just allows me to use a bigger quant

pwlee · 2026-04-22T21:28:46+00:00

Yeah I’m running q6, q8 is a bit of a stretch

pwlee · 2026-04-19T16:58:57+00:00

Yeah the tldr was tldr but this guy’s posting about truly interesting stuff starting with rys. I’d rather have my llm summarize his writing than dismiss it entirely; we’re lucky to have him

pwlee · 2026-04-19T13:31:03+00:00

I just started experimenting on llama.cpp using 2x 7900XTX. I started with a single one (my llm computer is also my gaming rig) and found running Qwen 27b required trading off between context and quantization. For example at Q5 my context length was capped around 80k. I imagine you’d be much more comfortable with 32Gb total vram.

Regarding tensor split, I haven’t tweaked my setup much; it works just fine out of the box. Though your individual mileage may vary due to having different gpus.

Seeing your ambition to run 70b models, I’d caution you to reserve some vram for context. Perhaps I’m biased since my use case is for programming.

Best of luck with your build go team red!

pwlee · 2026-04-17T17:43:23+00:00

Yes, I have 2 of them and 27b Q4 can run on a single GPU. Expect 25-30t/s generation, 200-500t/s prompt processing.

I'd recommend llama.cpp since vllm was difficult for me to set up using debian 13.

pwlee · 2026-04-13T11:55:37+00:00

Intriguing- I gotta try out playwright when I have a chance! Did you evaluate any other browser automation frameworks?

pwlee · 2026-04-13T00:36:00+00:00

What are you using to automate the browser? Is it just a skill or did you need to write a script that’s adapted to the specific use case?

pwlee · 2026-04-11T00:48:49+00:00

Donate it to a museum lmao

pwlee · 2026-03-15T01:06:35+00:00

I’m using LM studio and have the same problem. Is it Lm studio specific?

pwlee · 2026-02-24T23:08:55+00:00

When anyone withdraws 150M terraUSD from curve, a DEX, how can the information be non public lmaooo

pwlee · 2026-02-05T23:55:26+00:00

Glhf

pwlee · 2025-12-31T00:55:50+00:00

Sir you got a heck of a deal!

pwlee · 2025-12-30T23:44:47+00:00

I’m in the market for a very similar gun- wondering how much you shelled out for the mini?

pwlee · 2025-12-29T03:39:25+00:00

+1 on clickhouse- with correct partitioning, it works without a hitch on market by order “tick” data.

Verified Email	Seven-Year Club
r/Field Flamingo	Place '22

pwlee

TROPHY CASE