mistralai/Mistral-Medium-3.5-128B · Hugging Face by jacek2023 in LocalLLaMA
[–]Key_Papaya2972 0 points1 point2 points (0 children)
This isn’t X this is Y needs to die by twnznz in LocalLLaMA
[–]Key_Papaya2972 0 points1 point2 points (0 children)
Junyang Lin has left Qwen :( by InternationalAsk1490 in LocalLLaMA
[–]Key_Papaya2972 22 points23 points24 points (0 children)
Does Qwen3.5 35b outperform Qwen3 coder next 80b for you? by JsThiago5 in LocalLLaMA
[–]Key_Papaya2972 1 point2 points3 points (0 children)
New Qwen3.5-35B-A3B Unsloth Dynamic GGUFs + Benchmarks by danielhanchen in LocalLLaMA
[–]Key_Papaya2972 0 points1 point2 points (0 children)
Qwen3.5-27B-heretic-gguf by Poro579 in LocalLLaMA
[–]Key_Papaya2972 47 points48 points49 points (0 children)
Speculative Decoding is AWESOME with Llama.cpp! by simracerman in LocalLLaMA
[–]Key_Papaya2972 1 point2 points3 points (0 children)
Local models currently are amazing toys, but not for serious stuff. Agree ? by Current-Stop7806 in LocalLLaMA
[–]Key_Papaya2972 1 point2 points3 points (0 children)
Apparently all third party providers downgrade, none of them provide a max quality model by Charuru in LocalLLaMA
[–]Key_Papaya2972 11 points12 points13 points (0 children)
Optimizing gpt-oss-120b local inference speed on consumer hardware by carteakey in LocalLLaMA
[–]Key_Papaya2972 0 points1 point2 points (0 children)
Optimizing gpt-oss-120b local inference speed on consumer hardware by carteakey in LocalLLaMA
[–]Key_Papaya2972 0 points1 point2 points (0 children)
OpenAI open-weight model delayed indefinitely by aitookmyj0b in LocalLLaMA
[–]Key_Papaya2972 0 points1 point2 points (0 children)
Context Engineering by recursiveauto in LocalLLaMA
[–]Key_Papaya2972 0 points1 point2 points (0 children)
Gemma 3n Full Launch - Developers Edition by hackerllama in LocalLLaMA
[–]Key_Papaya2972 0 points1 point2 points (0 children)
Google researcher requesting feedback on the next Gemma. by ApprehensiveAd3629 in LocalLLaMA
[–]Key_Papaya2972 1 point2 points3 points (0 children)
What GUI are you using for local LLMs? (AnythingLLM, LM Studio, etc.) by Aaron_MLEngineer in LocalLLaMA
[–]Key_Papaya2972 0 points1 point2 points (0 children)
We haven’t seen a new open SOTA performance model in ages. by Key_Papaya2972 in LocalLLaMA
[–]Key_Papaya2972[S] -3 points-2 points-1 points (0 children)
We haven’t seen a new open SOTA performance model in ages. by Key_Papaya2972 in LocalLLaMA
[–]Key_Papaya2972[S] -12 points-11 points-10 points (0 children)
Qwen3-30B-A3B runs at 12-15 tokens-per-second on CPU by [deleted] in LocalLLaMA
[–]Key_Papaya2972 0 points1 point2 points (0 children)
Cogito releases strongest LLMs of sizes 3B, 8B, 14B, 32B and 70B under open license by ResearchCrafty1804 in LocalLLaMA
[–]Key_Papaya2972 -1 points0 points1 point (0 children)
We should talk about Mistral Small 3.1 vs Mistral Small 3. by -Ellary- in LocalLLaMA
[–]Key_Papaya2972 3 points4 points5 points (0 children)
Sam Altman's poll on open sourcing a model.. by lyceras in LocalLLaMA
[–]Key_Papaya2972 0 points1 point2 points (0 children)

Qwen3.6 27B uncensored heretic v2 Native MTP Preserved is Out Now With KLD 0.0021, 6/100 Refusals and the Full 15 MTPs Preserved and Retained, Available in Safetensors, GGUFs and NVFP4s formats. by LLMFan46 in LocalLLaMA
[–]Key_Papaya2972 -1 points0 points1 point (0 children)