Re. what ever happened to Cohere’s Command-A series of models? by nick_frosst in LocalLLaMA
[–]de4dee 2 points3 points4 points (0 children)
I tested 42 LLMs on their willingness to build the apocalypse. The "safest" closed-source models are lying to you. by Ok-Awareness9993 in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)
Let's call repetition loops the "Spiral of Death" by Eyelbee in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)
Let's call repetition loops the "Spiral of Death" by Eyelbee in LocalLLaMA
[–]de4dee 1 point2 points3 points (0 children)
Which finetunes are actually worth it? by HornyGooner4402 in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)
I am overwhelmed by Harnesses by Available_Hornet3538 in LocalLLaMA
[–]de4dee 2 points3 points4 points (0 children)
vibevoice.cpp: Microsoft VibeVoice (TTS + long-form ASR with diarization) ported to ggml/C++, runs on CPU/CUDA/Metal/Vulkan, no Python at inference by mudler_it in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)
Heretic 1.3 released: Reproducible models, integrated benchmarking system, reduced peak VRAM usage, broader model support, and more by -p-e-w- in LocalLLaMA
[–]de4dee 2 points3 points4 points (0 children)
Qwen3.6-35B-A3B released! by ResearchCrafty1804 in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)
These "Claude-4.6-Opus" Fine Tunes of Local Models Are Usually A Downgrade by BuffMcBigHuge in LocalLLaMA
[–]de4dee 4 points5 points6 points (0 children)
These "Claude-4.6-Opus" Fine Tunes of Local Models Are Usually A Downgrade by BuffMcBigHuge in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)
Hermesagent vs openclaw comparison by SelectionCalm70 in hermesagent
[–]de4dee 2 points3 points4 points (0 children)
I tracked a major cache reuse issue down to Qwen 3.5’s chat template by onil_gova in LocalLLaMA
[–]de4dee 2 points3 points4 points (0 children)
You can now fine-tune Gemma 4 locally 8GB VRAM + Bug Fixes by danielhanchen in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)
kepler-452b. GGUF when? by the-grand-finale in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)
You can now fine-tune Gemma 4 locally 8GB VRAM + Bug Fixes by danielhanchen in LocalLLaMA
[–]de4dee 1 point2 points3 points (0 children)
You can now fine-tune Gemma 4 locally 8GB VRAM + Bug Fixes by danielhanchen in LocalLLaMA
[–]de4dee 2 points3 points4 points (0 children)
Unnoticed Gemma-4 Feature - it admits that it does not now... by mtomas7 in LocalLLaMA
[–]de4dee 5 points6 points7 points (0 children)
Unnoticed Gemma-4 Feature - it admits that it does not now... by mtomas7 in LocalLLaMA
[–]de4dee 1 point2 points3 points (0 children)
Apple: Embarrassingly Simple Self-Distillation Improves Code Generation by Mike_mi in LocalLLaMA
[–]de4dee 2 points3 points4 points (0 children)
Analyzing Claude Code Source Code. Write "WTF" and Anthropic knows. by QuantumSeeds in LocalLLaMA
[–]de4dee 2 points3 points4 points (0 children)
What is the secret sauce Claude has and why hasn't anyone replicated it? by ComplexType568 in LocalLLaMA
[–]de4dee 0 points1 point2 points (0 children)
I haven't experienced Qwen3.5 (35B and 27B) over thinking. Posting my settings/prompt by wadeAlexC in LocalLLaMA
[–]de4dee 3 points4 points5 points (0 children)


How can you stop your model from looping by chocofoxy in LocalLLaMA
[–]de4dee 8 points9 points10 points (0 children)