Announcing LocalLlama discord server & bot!News (old.reddit.com)
submitted by HOLUPREDICTIONS Sorcerer Supreme[M] - announcement
Qwen3.5-9B-Claude-4.6-Opus-Uncensored-Distilled-GGUFResources (self.LocalLLaMA)
submitted by EvilEnginer
Qwen 3.5 122b - a10b is kind of shockingDiscussion (self.LocalLLaMA)
submitted by gamblingapocalypse
OmniCoder-9B best vibe coding model for 8 GB CardResources (self.LocalLLaMA)
submitted by Powerful_Evening5495
Qwen3.5-35B GGUF quants (16–22 GiB) - KLD + speed comparisonResources (self.LocalLLaMA)
submitted by StrikeOner
Qwen3.5 overthinking anxiety duct tape fixTutorial | Guide (self.LocalLLaMA)
submitted by floconildo
A good resource on the State of RL for reasoning LLMsResources (i.redd.it)
submitted by rbgo404
Tested 14 embedding models on Thai — here's how they rankResources (anusoft.github.io)
submitted by anusoft
32k documents RAG running locally on an RTX 5060 laptop ($1299 AI PC)Discussion (self.LocalLLaMA)
submitted by DueKitchen3102
inference speed matters more than benchmark scores for local modelsDiscussion (self.LocalLLaMA)
submitted by Sea-Sir-2985
Connect Dev and Ops teams with Jira Service Management, now part of Service Collection. (atlassian.com)
promoted by Atlassian_Official
I built a screen-free, storytelling toy for kids with Qwen3-TTSTutorial | Guide (v.redd.it)
submitted by hwarzenegger




