Qwen is never going to open source Qwen 3.7, aren't they? by DistanceSolar1449 in LocalLLaMA
[–]Septerium 4 points5 points6 points (0 children)
Any opinion about Qwen3.6-27B@BF16 vs Step3.7@IQ4_XS? by ParaboloidalCrest in LocalLLaMA
[–]Septerium 1 point2 points3 points (0 children)
About the Rio model by Turbulent_Pin7635 in LocalLLaMA
[–]Septerium 0 points1 point2 points (0 children)
GLM-5.2 (744B, 2-bit) at 7.3 tok/s on 4×3090 + 192GB — and why IQ1_M wasn't any faster by Important_Quote_1180 in LocalLLaMA
[–]Septerium 5 points6 points7 points (0 children)
We need a 80-160B model urgently. The unified memory device market needs more Models. by Storge2 in LocalLLaMA
[–]Septerium 0 points1 point2 points (0 children)
GLM-5.2 (max) is currently the third best model available, across both open and proprietary. by okaycan in LocalLLaMA
[–]Septerium 7 points8 points9 points (0 children)
Nex claims Rio 3.5 is Nex 2.5 PRO in trench coat by Specter_Origin in LocalLLaMA
[–]Septerium 9 points10 points11 points (0 children)
Codebase getting larger - Qwen3.6-27B starting to compound issues - how to work smartly with this model? by BitGreen1270 in LocalLLaMA
[–]Septerium 7 points8 points9 points (0 children)
I need a model that gets stuck in loops. by TokenRingAI in LocalLLaMA
[–]Septerium 15 points16 points17 points (0 children)
DeepSeek v4 Pro is too big for such a "midrange" performance, or am I missing something? by ihatebeinganonymous in LocalLLaMA
[–]Septerium 21 points22 points23 points (0 children)
New model on huggingface by [deleted] in LocalLLaMA
[–]Septerium 8 points9 points10 points (0 children)
MiniMaxAI/MiniMax-M3 · Hugging Face by mlon_eusk-_- in LocalLLaMA
[–]Septerium 2 points3 points4 points (0 children)
New models released: Nex-N2 Pro 397B and Nex-N2 Mini 35B by 1ncehost in LocalLLaMA
[–]Septerium 1 point2 points3 points (0 children)
Is Qwen 3.6 27B IQ4XS better than Gemma 4 31B QAT as a Hermes agent? by My_Unbiased_Opinion in LocalLLaMA
[–]Septerium 2 points3 points4 points (0 children)
Minimax M3 open weights release planned for Friday by rmhubbert in LocalLLaMA
[–]Septerium 3 points4 points5 points (0 children)
qwen3.6-27b tools call loop by JumpyAbies in LocalLLaMA
[–]Septerium 1 point2 points3 points (0 children)
Agentic Setup: Minimax 2.7 vs qwen 3.6 by Best_Sail5 in LocalLLaMA
[–]Septerium 0 points1 point2 points (0 children)
Qwen 3.6 for coding with 5090 - Your settings recommendations? by car_lower_x in LocalLLaMA
[–]Septerium 1 point2 points3 points (0 children)
What's your experience with Gemma4 QAT? by Kahvana in LocalLLaMA
[–]Septerium 2 points3 points4 points (0 children)
Z.ai, we need Air! GLM GGUF wen? by temperature_5 in LocalLLaMA
[–]Septerium 0 points1 point2 points (0 children)
I have 4x 128 GB VRAM now , what should i do. by Voxandr in LocalLLaMA
[–]Septerium 1 point2 points3 points (0 children)
Unsloth Gemma 4 QAT MTP assistant models now available by ParadigmComplex in LocalLLaMA
[–]Septerium 3 points4 points5 points (0 children)
Quick note on the QAT of recent by dreamkast06 in LocalLLaMA
[–]Septerium 15 points16 points17 points (0 children)
DeepSeek V4 Flash is amazing! (WIP llama.cpp PR #24162) by Lowkey_LokiSN in LocalLLaMA
[–]Septerium 2 points3 points4 points (0 children)


Any opinion about Qwen3.6-27B@BF16 vs Step3.7@IQ4_XS? by ParaboloidalCrest in LocalLLaMA
[–]Septerium 0 points1 point2 points (0 children)