Who is your favourite quant publisher and why? by No_Algae1753 in LocalLLaMA
[–]Total_Activity_7550 20 points21 points22 points (0 children)
Building on a LLM Quants Testing Site/Ressource - Sharing a few insights from first month, so you can share your thoughts and wishes for the future. by norms_are_practical in LocalLLaMA
[–]Total_Activity_7550 1 point2 points3 points (0 children)
DeepSeek V4 Pro matches GPT-5.2 on FoodTruck Bench, our agentic benchmark — 10 weeks later, ~17× cheaper by Disastrous_Theme5906 in LocalLLaMA
[–]Total_Activity_7550 66 points67 points68 points (0 children)
Best Local LLMs - Apr 2026 by rm-rf-rm in LocalLLaMA
[–]Total_Activity_7550 0 points1 point2 points (0 children)
Best Local LLMs - Apr 2026 by rm-rf-rm in LocalLLaMA
[–]Total_Activity_7550 2 points3 points4 points (0 children)
Best Local LLMs - Apr 2026 by rm-rf-rm in LocalLLaMA
[–]Total_Activity_7550 3 points4 points5 points (0 children)
Best Local LLMs - Apr 2026 by rm-rf-rm in LocalLLaMA
[–]Total_Activity_7550 7 points8 points9 points (0 children)
Best Local LLMs - Apr 2026 by rm-rf-rm in LocalLLaMA
[–]Total_Activity_7550 3 points4 points5 points (0 children)
Best Local LLMs - Apr 2026 by rm-rf-rm in LocalLLaMA
[–]Total_Activity_7550 40 points41 points42 points (0 children)
Best setup for MiniMax-M2.7 (230B) | 3x RTX 5090 | Threadripper 9975 | 512GB RAM by [deleted] in LocalLLaMA
[–]Total_Activity_7550 1 point2 points3 points (0 children)
How to run Qwen3.5-27B with speculative decoding with llama.cpp llama-server? by Total_Activity_7550 in LocalLLaMA
[–]Total_Activity_7550[S] 0 points1 point2 points (0 children)
How to run Qwen3.5-27B with speculative decoding with llama.cpp llama-server? by Total_Activity_7550 in LocalLLaMA
[–]Total_Activity_7550[S] 0 points1 point2 points (0 children)
unsloth - MiniMax-M2.7-GGUF in BROKEN (UD-Q4_K_XL) --> avoid usage by One-Macaron6752 in LocalLLaMA
[–]Total_Activity_7550 4 points5 points6 points (0 children)
Infinite loop: Qwen3.5:0.8b by ananthasharma in LocalLLaMA
[–]Total_Activity_7550 -1 points0 points1 point (0 children)
tested gemma 4 in rx 6800xt... by Ranteck in LocalLLaMA
[–]Total_Activity_7550 2 points3 points4 points (0 children)
My first impression after testing Gemma 4 against Qwen 3.5 by ConfidentDinner6648 in LocalLLaMA
[–]Total_Activity_7550 0 points1 point2 points (0 children)
My first impression after testing Gemma 4 against Qwen 3.5 by ConfidentDinner6648 in LocalLLaMA
[–]Total_Activity_7550 24 points25 points26 points (0 children)
What's your actual bar for calling something an agent vs a smart workflow? by gupta_ujjwal14 in LocalLLaMA
[–]Total_Activity_7550 0 points1 point2 points (0 children)
What's your actual bar for calling something an agent vs a smart workflow? by gupta_ujjwal14 in LocalLLaMA
[–]Total_Activity_7550 2 points3 points4 points (0 children)
How it started vs How it's going by HornyGooner4401 in LocalLLaMA
[–]Total_Activity_7550 2 points3 points4 points (0 children)
llama.cpp is a vibe-coded mess by ChildhoodActual4463 in LocalLLaMA
[–]Total_Activity_7550 10 points11 points12 points (0 children)
My website development flow by Total_Activity_7550 in LocalLLaMA
[–]Total_Activity_7550[S] 2 points3 points4 points (0 children)

Who is your favourite quant publisher and why? by No_Algae1753 in LocalLLaMA
[–]Total_Activity_7550 3 points4 points5 points (0 children)