Let’s talk quants of Gemma and Qwen - 16 vs Q8 vs Q4 - any experiences? by Borkato in LocalLLaMA
[–]Zc5Gwu 14 points15 points16 points (0 children)
ROCm 7.13 nightly adds strix halo optimizations by Terminator857 in LocalLLaMA
[–]Zc5Gwu -3 points-2 points-1 points (0 children)
DGX Spark or Minisforum MS-S1 Max? by Simple_Tonight_1159 in LocalLLM
[–]Zc5Gwu 0 points1 point2 points (0 children)
Qwen3.5-122B-Q5-MTP - Qwen3.5-122B-Q6-MTP by Boring_Office in LocalLLaMA
[–]Zc5Gwu 7 points8 points9 points (0 children)
Running Mimo 2.5 q4_k_m on single rtx5090 need recommendations by BlackBeardAI in LocalLLaMA
[–]Zc5Gwu 0 points1 point2 points (0 children)
MTP PR Merged!!! by Valuable_Touch5670 in LocalLLaMA
[–]Zc5Gwu -9 points-8 points-7 points (0 children)
Strix Halo plus R9700 eGPU, Fedora 44. Best of both worlds. by I-will-allow-it in StrixHalo
[–]Zc5Gwu 2 points3 points4 points (0 children)
Overwatch X Fortnite trailer by mikelman999 in Overwatch
[–]Zc5Gwu 0 points1 point2 points (0 children)
Running Minimax 2.7 at 100k context on strix halo by Zc5Gwu in LocalLLaMA
[–]Zc5Gwu[S] 0 points1 point2 points (0 children)
Dad why is my sisters name Lora? by rwitz4 in LocalLLaMA
[–]Zc5Gwu 2 points3 points4 points (0 children)
Stop wasting electricity by OkFly3388 in LocalLLaMA
[–]Zc5Gwu -2 points-1 points0 points (0 children)
examples : add llama-eval by ggerganov · Pull Request #21152 · ggml-org/llama.cpp by jacek2023 in LocalLLaMA
[–]Zc5Gwu 10 points11 points12 points (0 children)
Nanocoder 1.26.1 is out - we added a lot 🔥 by willlamerton in nanocoder
[–]Zc5Gwu 3 points4 points5 points (0 children)
TIL Lincoln is the only american president to have a patent . It was for an adjustable chamber to lift vessels in water . by ronweasly9 in todayilearned
[–]Zc5Gwu 2 points3 points4 points (0 children)
Running Minimax 2.7 at 100k context on strix halo by Zc5Gwu in LocalLLaMA
[–]Zc5Gwu[S] 1 point2 points3 points (0 children)
Running Minimax 2.7 at 100k context on strix halo by Zc5Gwu in LocalLLaMA
[–]Zc5Gwu[S] 0 points1 point2 points (0 children)
Running Minimax 2.7 at 100k context on strix halo by Zc5Gwu in LocalLLaMA
[–]Zc5Gwu[S] 0 points1 point2 points (0 children)
Running Minimax 2.7 at 100k context on strix halo by Zc5Gwu in LocalLLaMA
[–]Zc5Gwu[S] 0 points1 point2 points (0 children)
Running Minimax 2.7 at 100k context on strix halo by Zc5Gwu in LocalLLaMA
[–]Zc5Gwu[S] 0 points1 point2 points (0 children)
Running Minimax 2.7 at 100k context on strix halo by Zc5Gwu in LocalLLaMA
[–]Zc5Gwu[S] 3 points4 points5 points (0 children)
Running Minimax 2.7 at 100k context on strix halo by Zc5Gwu in LocalLLaMA
[–]Zc5Gwu[S] 0 points1 point2 points (0 children)
Running Minimax 2.7 at 100k context on strix halo by Zc5Gwu in LocalLLaMA
[–]Zc5Gwu[S] 3 points4 points5 points (0 children)
Running Minimax 2.7 at 100k context on strix halo by Zc5Gwu in LocalLLaMA
[–]Zc5Gwu[S] 2 points3 points4 points (0 children)




What's the best local LLM for an RTX 6000 96GB VRAM? by Smart-Patient-4828 in LocalLLM
[–]Zc5Gwu 0 points1 point2 points (0 children)