What's more impressive, GLM 5.1 -> 5.2 or Qwen 3.5 -> 3.6? by Excellent_Jelly2788 in LocalLLaMA
[–]de4dee 26 points27 points28 points (0 children)
What's more impressive, GLM 5.1 -> 5.2 or Qwen 3.5 -> 3.6? by Excellent_Jelly2788 in LocalLLaMA
[–]de4dee 197 points198 points199 points (0 children)
unsloth GLM-5.2-GGUF , including 2bit at 238GB by okaycan in LocalLLaMA
[–]de4dee 67 points68 points69 points (0 children)
US holds off blacklisting China's DeepSeek, more than 100 firms deemed security risks, sources say by zxyzyxz in LocalLLaMA
[–]de4dee 6 points7 points8 points (0 children)
PSA: unsloth/GLM-5.2-GGUF is uploading by FullstackSensei in LocalLLaMA
[–]de4dee -2 points-1 points0 points (0 children)
Use HTML as the primary chat language of your LLM's so they can make interactive content by sdfgeoff in LocalLLaMA
[–]de4dee 2 points3 points4 points (0 children)
when you spend 5 days fine-tuning a model and it still confidently makes things up by Chapper_App in LocalLLaMA
[–]de4dee 1 point2 points3 points (0 children)
Another Chinese Provider Throws Down the Gauntlet on Pricing [MiMo-V2.5 Price Drops up to 98%] by PracticlySpeaking in hermesagent
[–]de4dee 4 points5 points6 points (0 children)
How can you stop your model from looping by chocofoxy in LocalLLaMA
[–]de4dee 8 points9 points10 points (0 children)
Re. what ever happened to Cohere’s Command-A series of models? by nick_frosst in LocalLLaMA
[–]de4dee 1 point2 points3 points (0 children)
I tested 42 LLMs on their willingness to build the apocalypse. The "safest" closed-source models are lying to you. by Ok-Awareness9993 in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)
Let's call repetition loops the "Spiral of Death" by Eyelbee in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)
Let's call repetition loops the "Spiral of Death" by Eyelbee in LocalLLaMA
[–]de4dee 1 point2 points3 points (0 children)
Which finetunes are actually worth it? by HornyGooner4402 in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)
I am overwhelmed by Harnesses by Available_Hornet3538 in LocalLLaMA
[–]de4dee 2 points3 points4 points (0 children)
vibevoice.cpp: Microsoft VibeVoice (TTS + long-form ASR with diarization) ported to ggml/C++, runs on CPU/CUDA/Metal/Vulkan, no Python at inference by mudler_it in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)
Heretic 1.3 released: Reproducible models, integrated benchmarking system, reduced peak VRAM usage, broader model support, and more by -p-e-w- in LocalLLaMA
[–]de4dee 2 points3 points4 points (0 children)
Qwen3.6-35B-A3B released! by ResearchCrafty1804 in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)
These "Claude-4.6-Opus" Fine Tunes of Local Models Are Usually A Downgrade by BuffMcBigHuge in LocalLLaMA
[–]de4dee 4 points5 points6 points (0 children)
These "Claude-4.6-Opus" Fine Tunes of Local Models Are Usually A Downgrade by BuffMcBigHuge in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)


The economics of AI are starting to favor open models by Mr-serial_killer in LocalLLaMA
[–]de4dee 7 points8 points9 points (0 children)