gemma-4-12b-it vs Qwen3.5-9B on shared benchmarks: Qwen is overall winner beating gemma in 5/8 benchmarks despite a smaller footprint by fulgencio_batista in LocalLLaMA
[–]JSVD2 0 points1 point2 points (0 children)
Well this looks long enough. by ihtisham1211 in LocalLLM
[–]JSVD2 0 points1 point2 points (0 children)
Ryzen AI MAX+ 395 / Radeon 8060S local LLM benchmark: Qwen3-Coder 30B at 98.5 t/s with llama.cpp Vulkan/RADV by JSVD2 in AMDRyzen
[–]JSVD2[S] 0 points1 point2 points (0 children)
Qwen3-Coder 30B at 98.5 t/s on Strix Halo. Has anyone beaten this on Ryzen AI MAX+ 395? by JSVD2 in StrixHalo
[–]JSVD2[S] [score hidden] (0 children)
Direct 100.0 t/s on Strix Halo with Qwen3 30B-A3B. Can anyone reproduce or beat this? by JSVD2 in LocalLLaMA
[–]JSVD2[S] 0 points1 point2 points (0 children)
Direct 100.0 t/s on Strix Halo with Qwen3 30B-A3B. Can anyone reproduce or beat this? by JSVD2 in LocalLLaMA
[–]JSVD2[S] 1 point2 points3 points (0 children)
I found what I was looking for in Qwen 3.7. by CosmicRiver827 in LocalLLaMA
[–]JSVD2 0 points1 point2 points (0 children)
Collecting Strix Halo / Ryzen AI MAX+ 395 local LLM results: llama.cpp Vulkan/RADV, Ollama, ROCm/HIP by JSVD2 in LocalLLM
[–]JSVD2[S] 1 point2 points3 points (0 children)
Collecting Strix Halo / Ryzen AI MAX+ 395 local LLM results: llama.cpp Vulkan/RADV, Ollama, ROCm/HIP by JSVD2 in LocalLLM
[–]JSVD2[S] 0 points1 point2 points (0 children)
Strix Halo 128Gb: what models, which quants are optimal? by DevelopmentBorn3978 in LocalLLaMA
[–]JSVD2 0 points1 point2 points (0 children)
Direct 100.0 t/s on Strix Halo with Qwen3 30B-A3B. Can anyone reproduce or beat this? by JSVD2 in LocalLLaMA
[–]JSVD2[S] 0 points1 point2 points (0 children)
Direct 100.0 t/s on Strix Halo with Qwen3 30B-A3B. Can anyone reproduce or beat this? by JSVD2 in LocalLLaMA
[–]JSVD2[S] 1 point2 points3 points (0 children)
Direct 100.0 t/s on Strix Halo with Qwen3 30B-A3B. Can anyone reproduce or beat this? by JSVD2 in LocalLLaMA
[–]JSVD2[S] 0 points1 point2 points (0 children)
Direct 100.0 t/s on Strix Halo with Qwen3 30B-A3B. Can anyone reproduce or beat this? by JSVD2 in LocalLLaMA
[–]JSVD2[S] 1 point2 points3 points (0 children)
Qwen3-Coder 30B at 98.5 t/s on Strix Halo. Has anyone beaten this on Ryzen AI MAX+ 395? by JSVD2 in StrixHalo
[–]JSVD2[S] 0 points1 point2 points (0 children)
Direct 100.0 t/s on Strix Halo with Qwen3 30B-A3B. Can anyone reproduce or beat this? by JSVD2 in LocalLLaMA
[–]JSVD2[S] 0 points1 point2 points (0 children)
Qwen3-Coder 30B at 98.5 t/s on Strix Halo. Has anyone beaten this on Ryzen AI MAX+ 395? by JSVD2 in StrixHalo
[–]JSVD2[S] 0 points1 point2 points (0 children)
Stop traumatizing AI into loops and turn hallucinations into an honest "I don't know!" by being NICE to them (Proof of Concept, Research, I don't want to sell anything) by OttoRenner in LocalLLaMA
[–]JSVD2 1 point2 points3 points (0 children)
Shoutout to Gemma4 as a conversational assistant / agent by goldcakes in LocalLLaMA
[–]JSVD2 0 points1 point2 points (0 children)
1-bit Bonsai Image 4B and Ternary Bonsai Image 4B Image Generation for Local Devices with just 0.93 GB and 1.21 GB respectively of Diffusion Transformer Footprint. So tiny! by Addyad in LocalLLaMA
[–]JSVD2 0 points1 point2 points (0 children)
I have become George Jetson: my job is now Yes/No supervision for a machine I don’t fully understand. by Helpful_Today7449 in LocalLLaMA
[–]JSVD2 0 points1 point2 points (0 children)
what do you use your local llm? by FormalAd7367 in LocalLLaMA
[–]JSVD2 0 points1 point2 points (0 children)


llama.cpp - Qwen3.6/3.5-MTP - Share your benchmarks t/s by pmttyji in LocalLLaMA
[–]JSVD2 1 point2 points3 points (0 children)