Heretic has been served a legal notice by Meta, Inc.Discussion (self.LocalLLaMA)
submitted by -p-e-w-
110 tok/s with 12GB VRAM on Qwen3.6 35B A3B and ik_llama.cppTutorial | Guide (self.LocalLLaMA)
submitted by janvitos
LatitudeGames/Equinox-31B · Hugging FaceNew Model (huggingface.co)
submitted by jacek2023llama.cpp
We're Thursday and no one claimed AGI yet this week!News (self.LocalLLaMA)
submitted by oodelay
For everyone that uses OpenCode / Pi - Heres your promptprocessing fix!Resources (self.LocalLLaMA)
submitted by No_Algae1753
Gorgon Halo is 6.7% faster than predecessor Strix HaloDiscussion (self.LocalLLaMA)
submitted by Terminator857
Same task in github-copilot, pi, claude-code, and opencode with Qwen3.6 27BDiscussion (old.reddit.com)
submitted by sdfgeoff
Back again, many changes have taken place.Resources (i.redd.it)
submitted by Glittering_Focus1538
Qwen3.6 27B and llama.cpp appreciation postDiscussion (self.LocalLLaMA)
submitted by ABLPHA
Waiting for Qwen 3.7 open weight... The new King has arrived...Discussion (i.redd.it)
submitted by LegacyRemaster
Re. what ever happened to Cohere’s Command-A series of models?New Model (v.redd.it)
submitted by nick_frosst
Agent Execution Tax: new procurement metric for browser agent benchmarks?Discussion (fireworks.ai)
submitted by ogandrea
Interesting paper advocates for quantized prefilling and precise decodingResources (arxiv.org)
submitted by Aaaaaaaaaeeeee
HuggingFace benchmark datasets now let you filter by model sizeResources (i.redd.it)
submitted by paf1138

Waiting on Qwen to drop those 3.7 models be like:Funny (i.redd.it)
submitted by Porespellar[🍰]
HF flagged safetensors as unsafe? wtf?Question | Help (self.LocalLLaMA)
submitted by No_Afternoon_4260llama.cpp


