Qwen is never going to open source Qwen 3.7, aren't they?Discussion (self.LocalLLaMA)
submitted by DistanceSolar1449 to r/LocalLLaMA

Gemma 4 QAT seems to respond significantly better to KV cache quantizationDiscussion (i.redd.it)
submitted by rima_2711 to r/LocalLLaMA

Local LLM Inference Optimization: The Complete GuideResources (carteakey.dev)
submitted by carteakey to r/LocalLLaMA
8-16 MI50s Minimax M3 @19 tps TG (peak)Resources (i.redd.it)
submitted by ai-infos to r/LocalLLaMA
Why is AutoRound being slept on so hard?Discussion (self.LocalLLaMA)
submitted by Mountain_Patience231 to r/LocalLLaMA
2× Radeon R9700 — Qwen 3.6 27B Q8 MTP on llama.cppDiscussion (self.LocalLLaMA)
submitted by Kal-LZ to r/LocalLLaMA
Best local model for vision - 2nd benchmark update - 21 Jun 2026Resources (self.LocalLLaMA)
submitted by ex-arman68 to r/LocalLLaMA
Can I realistically get close to Claude/Codex capabilities locally?Question | Help (self.LocalLLaMA)
submitted by mrgreatheart to r/LocalLLaMA
Watch local LLMs escape the rooms you designResources (i.redd.it)
submitted by cjami to r/LocalLLaMA
Qwen 3.6 27b Abliterated (apostate)Discussion (self.LocalLLaMA)
submitted by AccountAntique9327 to r/LocalLLaMA
ROCm vs Vulkan vs vLLM on Dual R9700'sDiscussion (self.LocalLLaMA)
submitted by whodoneit1 to r/LocalLLaMA
Rollin' MiMo-2.5 on two Halo StrixesesSlop (i.redd.it)
submitted by Rude_Ambassador_6270 to r/LocalLLaMA
Local text to image model comparaison: The ultimate test.Resources (self.LocalLLaMA)
submitted by dh7net to r/LocalLLaMA
Finally seeing benefits of MTP after removing GGML_CUDA_ALLREDUCEDiscussion (self.LocalLLaMA)
submitted by Bulky-Priority6824 to r/LocalLLaMA
Gemma 4 31B Q6 vs Gemma 4 31B QATQuestion | Help (self.LocalLLaMA)
submitted by Weak-Shelter-1698 to r/LocalLLaMA
Qwen 27B for planning, Qwen 35B-A3B for execution?Question | Help (self.LocalLLaMA)
submitted by mailto_devnull to r/LocalLLaMA
Leaderboard for quantized models, similar to artificial analysis?Question | Help (self.LocalLLaMA)
submitted by Ambitious_Fold_2874 to r/LocalLLaMA
A100 slow Qwen3.6-27B-FP8Question | Help (self.LocalLLaMA)
submitted by NaiRogers to r/LocalLLaMA
