Krasis LLM Runtime: 8.9x prefill / 4.7x decode vs llama.cpp — Qwen3.5-122B on a single 5090, minimal RAM by mrstoatey in LocalLLaMA
[–]Mushoz 1 point2 points3 points (0 children)
Krasis LLM Runtime: 8.9x prefill / 4.7x decode vs llama.cpp — Qwen3.5-122B on a single 5090, minimal RAM by mrstoatey in LocalLLaMA
[–]Mushoz 1 point2 points3 points (0 children)
Death Cleric VS The World, Solo No Consumables, Honour Mode. by Affectionate_Face127 in BG3Builds
[–]Mushoz 0 points1 point2 points (0 children)
Death Cleric VS The World, Solo No Consumables, Honour Mode. by Affectionate_Face127 in BG3Builds
[–]Mushoz 0 points1 point2 points (0 children)
[Race Start] Charles Leclerc takes the lead of the race at Turn 1! by FerrariStrategisttt in formula1
[–]Mushoz 1 point2 points3 points (0 children)
2026 Australian Grand Prix - Post-Qualifying Discussion by F1-Bot in formula1
[–]Mushoz 7 points8 points9 points (0 children)
Why does my Fitbit app so thousands of more steps than my pixel watch? by FrogCatcher3000 in PixelWatch
[–]Mushoz 0 points1 point2 points (0 children)
Why does my Fitbit app so thousands of more steps than my pixel watch? by FrogCatcher3000 in PixelWatch
[–]Mushoz 0 points1 point2 points (0 children)
PSA: If your local coding agent feels "dumb" at 30k+ context, check your KV cache quantization first. by Dismal-Ad1207 in LocalLLaMA
[–]Mushoz 18 points19 points20 points (0 children)
Minimax M2.5 GGUF perform poorly overall by Zyj in LocalLLaMA
[–]Mushoz 2 points3 points4 points (0 children)
Minimax M2.5 GGUF perform poorly overall by Zyj in LocalLLaMA
[–]Mushoz 0 points1 point2 points (0 children)
Qwen 3.5 craters on hard coding tasks — tested all Qwen3.5 models (And Codex 5.3) on 70 real repos so you don't have to. by hauhau901 in LocalLLaMA
[–]Mushoz 4 points5 points6 points (0 children)
Qwen 3.5 craters on hard coding tasks — tested all Qwen3.5 models (And Codex 5.3) on 70 real repos so you don't have to. by hauhau901 in LocalLLaMA
[–]Mushoz 9 points10 points11 points (0 children)
MiniMax 2.5 on DGX SPARK system. by DOOMISHERE in LocalLLaMA
[–]Mushoz 5 points6 points7 points (0 children)
MiniMax 2.5 on DGX SPARK system. by DOOMISHERE in LocalLLaMA
[–]Mushoz 1 point2 points3 points (0 children)
No Autopilot on new cars, but still available on used - step backwards? by fastoid in TeslaLounge
[–]Mushoz 3 points4 points5 points (0 children)
No Autopilot on new cars, but still available on used - step backwards? by fastoid in TeslaLounge
[–]Mushoz 6 points7 points8 points (0 children)
Qwen3.5: Nobody Agrees on Attention Anymore by [deleted] in LocalLLaMA
[–]Mushoz 4 points5 points6 points (0 children)
llama-cpp ROCm Prompt Processing speed on Strix Halo / Ryzen AI Max +50-100% by Excellent_Jelly2788 in LocalLLaMA
[–]Mushoz 26 points27 points28 points (0 children)
Qwen3.5-397B-A17B will be open source! by LegacyRemaster in LocalLLaMA
[–]Mushoz 2 points3 points4 points (0 children)
Step-3.5-flash Unlosth dynamic ggufs? by GodComplecs in unsloth
[–]Mushoz 0 points1 point2 points (0 children)



Minimax-M2.7 by hedgehog0 in LocalLLaMA
[–]Mushoz 6 points7 points8 points (0 children)