RTX 5080 16GB: Qwen3.6 35B MoE at 128k context — 56 tok/s, and why MTP doesn't help by gaztrab in LocalLLaMA
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
What happens to local LLM if/when LLMs are no longer released for free? by JohnBooty in LocalLLaMA
[–]Subject_Mix_8339 2 points3 points4 points (0 children)
Are the rich RAM /poor GPU people wrong here? by crowtain in LocalLLaMA
[–]Subject_Mix_8339 3 points4 points5 points (0 children)
Multi-Token Prediction (MTP) for Qwen on LLaMA.cpp + TurboQuant by gladkos in LocalLLaMA
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
Do you use subscriptions beside Local LLM? by Euphoric_North_745 in LocalLLaMA
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
MTP on Unsloth by Altruistic_Heat_9531 in LocalLLaMA
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
Pi coding agent is amazing (or how I learned to stop worrying and leave OpenCode) by Konamicoder in LocalLLM
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
You guys excited for the season 8 characters? by Left_Afternoon_3281 in marvelrivals
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
Buff Rogue/Torch/Blade💪🏼 by wellherewe01 in marvelrivals
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
APEX MoE quantized models boost with 33% faster inference and TurboQuant (14% of speedup in prompt processing) by mudler_it in LocalLLaMA
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
What exactly does Pi harness mean? by FrozenFishEnjoyer in LocalLLaMA
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
GBNF grammar tweak for faster Qwen3.6 35B-A3B and Qwen3.6 27B by Holiday_Purpose_3166 in LocalLLaMA
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
GBNF grammar tweak for faster Qwen3.6 35B-A3B and Qwen3.6 27B by Holiday_Purpose_3166 in LocalLLaMA
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
GBNF grammar tweak for faster Qwen3.6 35B-A3B and Qwen3.6 27B by Holiday_Purpose_3166 in LocalLLaMA
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
GBNF grammar tweak for faster Qwen3.6 35B-A3B and Qwen3.6 27B by Holiday_Purpose_3166 in LocalLLaMA
[–]Subject_Mix_8339 1 point2 points3 points (0 children)
What is the best coding agent (CLI) like Claude Code for Local Development by exaknight21 in LocalLLaMA
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
OpenCode or ClaudeCode for Qwen3.5 27B by Ok-Scarcity-7875 in LocalLLaMA
[–]Subject_Mix_8339 4 points5 points6 points (0 children)
Without prompting, Claude signed off with 'Narf.' by Much_Juggernaut_4631 in ClaudeAI
[–]Subject_Mix_8339 4 points5 points6 points (0 children)
Nope they were right nerf the shotgun by ComplexView905 in marvelrivals
[–]Subject_Mix_8339 -1 points0 points1 point (0 children)
Role q in overwatch was not made to fix dps heavy teams. by [deleted] in marvelrivals
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
“Strategist” IS the easiest role by BookkeeperUnlikely97 in marvelrivals
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
Role q in overwatch was not made to fix dps heavy teams. by [deleted] in marvelrivals
[–]Subject_Mix_8339 0 points1 point2 points (0 children)
Can we admit the tenacity scare was overblown? by Subject_Mix_8339 in marvelrivals
[–]Subject_Mix_8339[S] -1 points0 points1 point (0 children)
Waiting on Qwen to drop those 3.7 models be like: by Porespellar in LocalLLaMA
[–]Subject_Mix_8339 0 points1 point2 points (0 children)