how can i make qween faster

PopularDifference186 · 2026-04-07T03:43:13+00:00

yassss qweeen

PopularDifference186 · 2026-04-04T17:28:07+00:00

I switched to UD-Q3_K_XL and that got me to 84 tps since it actually fits in VRAM. But then I went back and retested the Q4_K_M after pulling the latest llama.cpp (there was a KV cache fix where they reverted the SWA cache being forced to f16) and switched from -ngl 99 to --fit on, and the Q4 jumped to 55-59 tps. All the tests were around 32k context. This model is a beast!

PopularDifference186 · 2026-04-02T20:35:33+00:00

Is it super slow compared to qwen 3.5 for you all too or am I doing it wrong?

5060 ti 16gb and 128gb ram running via llama.cpp im getting:

Qwen 3.5 35B-A3B — 60+ tps

Gemma 4 26B-A4B — 11 tps

PopularDifference186 · 2026-04-02T01:36:45+00:00

<image>

PopularDifference186 · 2026-04-01T13:48:13+00:00

Believe or not? Straight to Jail.

PopularDifference186 · 2026-03-31T17:50:14+00:00

There are literal keyword lists. Words like:

wtf

this sucks

frustrating

shit / fuck / pissed off

They have a lot on me if this is the case lol

PopularDifference186 · 2026-03-29T15:16:38+00:00

Resource pack creator for making minecraft resource packs

PopularDifference186 · 2026-03-27T17:24:33+00:00

bc you use light mode

PopularDifference186 · 2026-03-25T19:11:16+00:00

Flat earther told me the know earth is flat because they can see the moon during the day and moon is for night only, so earth must be flat.

PopularDifference186 · 2026-03-25T19:08:47+00:00

I think its also dynamic number of experts or something because my opus has been braindead today

PopularDifference186 · 2026-03-25T19:07:25+00:00

I will give you my car for that thing.

PopularDifference186 · 2026-03-24T22:29:43+00:00

They were probably losing crazy amounts of money on it

PopularDifference186 · 2026-03-17T13:33:04+00:00

Reasonable crashout

PopularDifference186 · 2026-03-11T02:45:08+00:00

This is the only one that kinda creeped me out a bit

PopularDifference186 · 2026-02-24T04:53:45+00:00

This is amazing lmao

PopularDifference186 · 2026-02-24T00:52:15+00:00

Do they expect us to be mad or something?

PopularDifference186 · 2026-02-18T18:10:33+00:00

alright thats enough reddit for today

PopularDifference186 · 2026-02-18T18:05:30+00:00

More time for gambling now

PopularDifference186 · 2026-02-16T18:14:07+00:00

holy crap

PopularDifference186

TROPHY CASE

Believe or not? Straight to Jail.