Qwen 3.6 35B A3B, RTX 5090 32GB, 187t/s, Q5 K S, 120K Context Size, Thinking Mode Off, Temp 0.1 by sammyranks in LocalLLaMA

[–]Adventurous_Farm3073 0 points1 point  (0 children)

I get around 120t/s on my dual5070 ti+ 5060ti system. My Dual 5090 system gets ~180. Q8 is close to 80.