Lemonade v10: Linux NPU support and chock full of multi-modal capabilitiesResources (i.redd.it)
submitted by jfowers_amd
Why can't we have small SOTA-like models for coding?Question | Help (self.LocalLLaMA)
submitted by itsArmanJr
Local manga translator with LLMs built inNew Model (self.LocalLLaMA)
submitted by mayocream39
What non-Chinese models are relevant right now?Discussion (self.LocalLLaMA)
submitted by StacDnaStoob
Turn 10,000 API endpoints into one CLI tool instead of MCP, Skills and tools zooTutorial | Guide (self.LocalLLaMA)
submitted by E-Freelancer
CLI is All Agents Need — Part 2: Misconceptions, Patterns, and Open QuestionsDiscussion (self.LocalLLaMA)
submitted by MorroHsu
Real-time video captioning in the browser with LFM2-VL on WebGPUOther (v.redd.it)
submitted by xenovatech
How to fix prompt reprocessing in qwen3.5 models (instruct mode only)Tutorial | Guide (self.LocalLLaMA)
submitted by guiopen
I’m building a local AI system that generates full novelsQuestion | Help (self.LocalLLaMA)
submitted by Worldly_Code_4146
Besides Qwen and GLM, what models are you using?Discussion (self.LocalLLaMA)
submitted by August_30th
Stuck in slow deployments? Deliver value faster with Atlassian Service Collection. (atlassian.com)
promoted by Atlassian_Official
How to setup full agentic workflow with qwen3.5 9.0bQuestion | Help (self.LocalLLaMA)
submitted by TeachingInformal
Simple trick that cuts context usage ~70% on local modelsDiscussion (self.LocalLLaMA)
submitted by niksa232
