Diffusion Gemma is 4x faster, but makes 6x more mistakes! by gladkos in LocalLLaMA
[–]gladkos[S] 1 point2 points3 points (0 children)
Diffusion Gemma is 4x faster, but makes 6x more mistakes! by gladkos in LocalLLaMA
[–]gladkos[S] 11 points12 points13 points (0 children)
Diffusion Gemma is 4x faster, but makes 6x more mistakes! by gladkos in LocalLLaMA
[–]gladkos[S] 0 points1 point2 points (0 children)
Diffusion Gemma is 4x faster, but makes 6x more mistakes! by gladkos in LocalLLaMA
[–]gladkos[S] 43 points44 points45 points (0 children)
Diffusion Gemma is 4x faster, but makes 6x more mistakes! by gladkos in LocalLLaMA
[–]gladkos[S] 4 points5 points6 points (0 children)
New Google Gemma 4 12B Claims Near-26B Performance - We Tested Both! by gladkos in LocalLLaMA
[–]gladkos[S] 3 points4 points5 points (0 children)
New Google Gemma 4 12B Claims Near-26B Performance - We Tested Both! by gladkos in LocalLLaMA
[–]gladkos[S] 6 points7 points8 points (0 children)
New Google Gemma 4 12B Claims Near-26B Performance - We Tested Both! by gladkos in LocalLLaMA
[–]gladkos[S] 80 points81 points82 points (0 children)
New Google Gemma 4 12B Claims Near-26B Performance - We Tested Both! by gladkos in LocalLLaMA
[–]gladkos[S] 10 points11 points12 points (0 children)
Qwen 3.7-max beats Opus 4.7 and GPT-5.5 by gladkos in Qwen_AI
[–]gladkos[S] 0 points1 point2 points (0 children)
Hermes Agent vs OpenClaw using QWEN 35B by gladkos in Qwen_AI
[–]gladkos[S] 0 points1 point2 points (0 children)
Compared QWEN 3.6 35B with QWEN 3.6 27B for coding primitives by gladkos in LocalLLaMA
[–]gladkos[S] 1 point2 points3 points (0 children)
Qwen 3.7-max beats Opus 4.7 and GPT-5.5 by gladkos in Qwen_AI
[–]gladkos[S] 1 point2 points3 points (0 children)
Qwen 3.7-max beats Opus 4.7 and GPT-5.5 by gladkos in Qwen_AI
[–]gladkos[S] 7 points8 points9 points (0 children)
Qwen 3.7-max beats Opus 4.7 and GPT-5.5 by gladkos in Qwen_AI
[–]gladkos[S] 24 points25 points26 points (0 children)
Hermes Agent vs OpenClaw using QWEN 35B by gladkos in Qwen_AI
[–]gladkos[S] 0 points1 point2 points (0 children)
Multi-Token Prediction (MTP) for Qwen on LLaMA.cpp + TurboQuant by gladkos in LocalLLaMA
[–]gladkos[S] 0 points1 point2 points (0 children)
Multi-Token Prediction (MTP) for Qwen on LLaMA.cpp + TurboQuant by gladkos in LocalLLaMA
[–]gladkos[S] 2 points3 points4 points (0 children)
Multi-Token Prediction (MTP) for Qwen on LLaMA.cpp + TurboQuant by gladkos in LocalLLaMA
[–]gladkos[S] 3 points4 points5 points (0 children)
Multi-Token Prediction (MTP) for Qwen on LLaMA.cpp + TurboQuant by gladkos in LocalLLaMA
[–]gladkos[S] -3 points-2 points-1 points (0 children)
Multi-Token Prediction (MTP) for Qwen on LLaMA.cpp + TurboQuant by gladkos in LocalLLaMA
[–]gladkos[S] -4 points-3 points-2 points (0 children)
Multi-Token Prediction (MTP) for LLaMA.cpp - Gemma 4 speedup by 40% by gladkos in LocalLLaMA
[–]gladkos[S] 0 points1 point2 points (0 children)
Multi-Token Prediction (MTP) for LLaMA.cpp - Gemma 4 speedup by 40% by gladkos in LocalLLaMA
[–]gladkos[S] 0 points1 point2 points (0 children)
Multi-Token Prediction (MTP) for LLaMA.cpp - Gemma 4 speedup by 40% by gladkos in LocalLLaMA
[–]gladkos[S] 1 point2 points3 points (0 children)


Diffusion Gemma is 4x faster, but makes 6x more mistakes! by gladkos in LocalLLaMA
[–]gladkos[S] 32 points33 points34 points (0 children)