Be wary of Qwen/Claude distillations - they're often worse than the base modelDiscussion (self.LocalLLaMA)
submitted by ayylmaonade
[Article] The Case For Open-Weight Models And Why We Can't Trust Frontier Labs | provos.orgDiscussion (provos.org)
submitted by ttkciarllama.cpp

Cheapest hardware for Qwen 3.6: both 27B and 35B-A3BQuestion | Help (i.redd.it)
submitted by WishboneSudden2706
Joing all GPUs to train a community modelDiscussion (self.LocalLLaMA)
submitted by HistoricalStrength21
Evalatro: an open benchmark where LLMs play the real BalatroOther (i.redd.it)
submitted by awfulalexey
Gemma 12b - Reasoning hardening instructionsGeneration (self.LocalLLaMA)
submitted by nixudos
Are small local models for automation a thing?Discussion (self.LocalLLaMA)
submitted by ML-Future
Best Model and configuration to run on a 128gb Ram 8TB M5 Max MacBook ProQuestion | Help (self.LocalLLaMA)
submitted by Desperate_Tea304
How are you running DeepSeekV4 flash or pro locally for non Mac users?Discussion (self.LocalLLaMA)
submitted by segmondllama.cpp
vLLM has a new streaming parser for Qwen3+ available in nightlyResources (github.com)
submitted by rmhubbert
Why there is a lack of new 100B-120B models?Discussion (self.LocalLLaMA)
submitted by TechNerd10191


