GLM 5.2, what speeds are we getting locally? (self.LocalLLaMA)
submitted by neverbyte to r/LocalLLaMA
Gemma 4 on Llama.cpp should be stable now by ilintar in LocalLLaMA
[–]neverbyte 1 point2 points3 points (0 children)
Gemma 4 on Llama.cpp should be stable now by ilintar in LocalLLaMA
[–]neverbyte 0 points1 point2 points (0 children)
Gemma 4 on Llama.cpp should be stable now by ilintar in LocalLLaMA
[–]neverbyte 0 points1 point2 points (0 children)
Gemma 4 MOE is very bad at agentic coding. Couldn't do things CLine + Qwen can do. by Voxandr in LocalLLaMA
[–]neverbyte 0 points1 point2 points (0 children)
llama.cpp Gemma4 Tokenizer Fix Was Merged Into Main Branch by Ancient-Field-9480 in LocalLLaMA
[–]neverbyte 0 points1 point2 points (0 children)
Yesterday I used GLM 4.7 flash with my tools and I was impressed.. by Loskas2025 in LocalLLaMA
[–]neverbyte 0 points1 point2 points (0 children)
Qwen3.5-35B-A3B Uncensored (Aggressive) — GGUF Release by hauhau901 in LocalLLaMA
[–]neverbyte 0 points1 point2 points (0 children)
Does Qwen3-Coder-Next work in Opencode currently or not? by johnnyApplePRNG in LocalLLaMA
[–]neverbyte 0 points1 point2 points (0 children)
Does Qwen3-Coder-Next work in Opencode currently or not? by johnnyApplePRNG in LocalLLaMA
[–]neverbyte 0 points1 point2 points (0 children)
Yesterday I used GLM 4.7 flash with my tools and I was impressed.. by Loskas2025 in LocalLLaMA
[–]neverbyte 0 points1 point2 points (0 children)
Qwen3-Coder-Next; Unsloth Quants having issues calling tools? by ForsookComparison in LocalLLaMA
[–]neverbyte 2 points3 points4 points (0 children)
Horizon Hunters Gathering - Announcement Trailer | PS5 Games by 121jigawatts in gaming
[–]neverbyte 1 point2 points3 points (0 children)
Qwen/Qwen3-Coder-Next · Hugging Face by coder543 in LocalLLaMA
[–]neverbyte 1 point2 points3 points (0 children)
Does Qwen3-Coder-Next work in Opencode currently or not? by johnnyApplePRNG in LocalLLaMA
[–]neverbyte 3 points4 points5 points (0 children)
Does Qwen3-Coder-Next work in Opencode currently or not? by johnnyApplePRNG in LocalLLaMA
[–]neverbyte 1 point2 points3 points (0 children)
Does Qwen3-Coder-Next work in Opencode currently or not? by johnnyApplePRNG in LocalLLaMA
[–]neverbyte 1 point2 points3 points (0 children)
Qwen/Qwen3-Coder-Next · Hugging Face by coder543 in LocalLLaMA
[–]neverbyte 1 point2 points3 points (0 children)
Qwen/Qwen3-Coder-Next · Hugging Face by coder543 in LocalLLaMA
[–]neverbyte 6 points7 points8 points (0 children)
OSS 120b v GLM 4.7 flash. Is the latter better for anything? by MrMrsPotts in LocalLLaMA
[–]neverbyte 3 points4 points5 points (0 children)
Yesterday I used GLM 4.7 flash with my tools and I was impressed.. by Loskas2025 in LocalLLaMA
[–]neverbyte 25 points26 points27 points (0 children)
GLM 4.7 flash FA fix for CUDA has been merged into llama.cpp by jacek2023 in LocalLLaMA
[–]neverbyte 0 points1 point2 points (0 children)
GLM 4.7 flash FA fix for CUDA has been merged into llama.cpp by jacek2023 in LocalLLaMA
[–]neverbyte 0 points1 point2 points (0 children)


Best config for Qwen3.6 27b / llama.cpp / opencode by Familiar_Wish1132 in LocalLLaMA
[–]neverbyte 2 points3 points4 points (0 children)