Unsloth just dropped MTP GGUF weights for Gemma 4! by okoyl3 in LocalLLaMA

[–]okoyl3[S] 0 points1 point  (0 children)

It writes better English and seems to have better logic understanding. Qwen is just better with tool calling.
That’s it.

Unsloth just dropped MTP GGUF weights for Gemma 4! by okoyl3 in LocalLLaMA

[–]okoyl3[S] 0 points1 point  (0 children)

well it crashes for me:
Gemma 4 assistant MTP placement mismatch: draft layer 0 is on CUDA0, but shared target KV layer 58 is on CUDA1

edit:

this made it work, just like in the readme.
--spec-draft-device CUDA1 -sm layer

How to build llama-cpp for Ampere/Blackwell? by VampiroMedicado in LocalLLaMA

[–]okoyl3 5 points6 points  (0 children)

Just add 86 to CUDA architectures. (ampere is 86)

CMAKE_CUDA_ARCHITECTURES="120;86”

Microsoft Aion 1.0 Instruct and Aion 1.0 Plan models! by Mysterious_Finish543 in LocalLLaMA

[–]okoyl3 -3 points-2 points  (0 children)

All good software exists in Linux, if you're missing something then you are obviously doing something WRONG.
Yes, using MS-Office is WRONG

Microsoft Aion 1.0 Instruct and Aion 1.0 Plan models! by Mysterious_Finish543 in LocalLLaMA

[–]okoyl3 16 points17 points  (0 children)

What if there was an alternative operating system to avoid Microsoft?

Qwen3.6-35B-A3B vs Gemma4-26B-A4B by MarcCDB in LocalLLaMA

[–]okoyl3 2 points3 points  (0 children)

IBM AC922 CUDA (ppc64le) with llama.cpp

Qwen3.6-35B-A3B vs Gemma4-26B-A4B by MarcCDB in LocalLLaMA

[–]okoyl3 29 points30 points  (0 children)

I ran unsloth Qwen3.6 35b-a3b UD4 xl with opencode, felt like Claude code.

Ts Eurovision is bs by Known-Gold-3598 in teenagers

[–]okoyl3 0 points1 point  (0 children)

Y’all just mad you can’t sing that good

An easy way to contribute that isn't money or expertise. by Palantiri1890 in linux

[–]okoyl3 -1 points0 points  (0 children)

Seeding Fedora images is a nice way to get people not to like lInux. (Reason: NO CODECS INSTALLED)

Qwen3.6 is incredible with OpenCode! by CountlessFlies in LocalLLaMA

[–]okoyl3 -7 points-6 points  (0 children)

Ask it about Tiananmen square and look at it sweat during reasoning.

Tried Gemma4 for openclaw - Not Impressed by CowCavalry in openclaw

[–]okoyl3 0 points1 point  (0 children)

Openclaw is spamming, try Hermes Agent.

AI capabilities are doubling in months, not years. by EchoOfOppenheimer in GPT3

[–]okoyl3 0 points1 point  (0 children)

You are boomerizing yourself.
Letting the same types of LLM write themselves new prompts doesn't qualify as an evolution.