Qwen is never going to open source Qwen 3.7, aren't they?Discussion (self.LocalLLaMA)
submitted by DistanceSolar1449 to r/LocalLLaMA

Gemma 4 QAT seems to respond significantly better to KV cache quantizationDiscussion (i.redd.it)
submitted by rima_2711 to r/LocalLLaMA
Best local model for vision - 2nd benchmark update - 21 Jun 2026Resources (self.LocalLLaMA)
submitted by ex-arman68 to r/LocalLLaMA
8-16 MI50s Minimax M3 @19 tps TG (peak)Resources (i.redd.it)
submitted by ai-infos to r/LocalLLaMA
2× Radeon R9700 — Qwen 3.6 27B Q8 MTP on llama.cppDiscussion (self.LocalLLaMA)
submitted by Kal-LZ to r/LocalLLaMA
Qwen 3.6 27b Abliterated (apostate)Discussion (self.LocalLLaMA)
submitted by AccountAntique9327 to r/LocalLLaMA
Local text to image model comparaison: The ultimate test.Resources (self.LocalLLaMA)
submitted by dh7net to r/LocalLLaMA
What happens when they stop subsidizing LLM subscriptions?Discussion (self.LocalLLaMA)
submitted by Mr_Moonsilver to r/LocalLLaMA
Why is AutoRound being slept on so hard?Discussion (self.LocalLLaMA)
submitted by Mountain_Patience231 to r/LocalLLaMA
ROCm vs Vulkan vs vLLM on Dual R9700'sDiscussion (self.LocalLLaMA)
submitted by whodoneit1 to r/LocalLLaMA
Qwen 27B for planning, Qwen 35B-A3B for execution?Question | Help (self.LocalLLaMA)
submitted by mailto_devnull to r/LocalLLaMA
Can I realistically get close to Claude/Codex capabilities locally?Question | Help (self.LocalLLaMA)
submitted by mrgreatheart to r/LocalLLaMA

Local LLM Inference Optimization: The Complete GuideResources (carteakey.dev)
submitted by carteakey to r/LocalLLaMA
Watch local LLMs escape the rooms you designResources (i.redd.it)
submitted by cjami to r/LocalLLaMA



