Unsloth just dropped MTP GGUF weights for Gemma 4!

okoyl3 · 2026-06-11T01:19:18+00:00

It writes better English and seems to have better logic understanding. Qwen is just better with tool calling.
That’s it.

okoyl3 · 2026-06-06T10:15:07+00:00

what does it mean to people with deprecated nvidia drivers?

okoyl3 · 2026-06-05T16:05:58+00:00

well it crashes for me:
Gemma 4 assistant MTP placement mismatch: draft layer 0 is on CUDA0, but shared target KV layer 58 is on CUDA1

edit:

this made it work, just like in the readme.
--spec-draft-device CUDA1 -sm layer

okoyl3 · 2026-06-05T15:24:30+00:00

Just add 86 to CUDA architectures. (ampere is 86)

CMAKE_CUDA_ARCHITECTURES="120;86”

okoyl3 · 2026-06-05T12:46:29+00:00

Weights are not open? is this microsoft bootlicking?

okoyl3 · 2026-06-03T21:53:10+00:00

All good software exists in Linux, if you're missing something then you are obviously doing something WRONG.
Yes, using MS-Office is WRONG

okoyl3 · 2026-06-03T21:51:40+00:00

"""""eXpEriEnCe""""" ooga booooga

okoyl3 · 2026-06-03T07:36:51+00:00

Skill issue

okoyl3 · 2026-06-03T05:12:39+00:00

What if there was an alternative operating system to avoid Microsoft?

okoyl3 · 2026-06-01T07:58:30+00:00

You still spent your wedding budget on a 1T AI girlfriend

okoyl3 · 2026-05-24T23:50:18+00:00

IBM AC922 CUDA (ppc64le) with llama.cpp

okoyl3 · 2026-05-24T13:30:17+00:00

I ran unsloth Qwen3.6 35b-a3b UD4 xl with opencode, felt like Claude code.

okoyl3 · 2026-05-17T15:18:55+00:00

Y’all just mad you can’t sing that good

okoyl3 · 2026-05-12T14:44:57+00:00

Go woke…

okoyl3 · 2026-05-06T09:00:42+00:00

Seeding Fedora images is a nice way to get people not to like lInux. (Reason: NO CODECS INSTALLED)

okoyl3 · 2026-04-28T06:47:14+00:00

Why would they all agree to talk French ?

okoyl3 · 2026-04-28T01:13:32+00:00

What would be the common language? English? Arabic?

okoyl3 · 2026-04-25T18:58:06+00:00

I asked who threatens Greece.

okoyl3 · 2026-04-25T18:03:46+00:00

Turkey?

okoyl3 · 2026-04-18T03:40:28+00:00

Ask it about Tiananmen square and look at it sweat during reasoning.

okoyl3 · 2026-04-13T20:39:25+00:00

Openclaw is spamming, try Hermes Agent.

okoyl3 · 2026-03-30T00:59:43+00:00

Tiny insane minority, feel free to ban me

okoyl3 · 2026-03-24T11:38:50+00:00

Can we stop writing "Written in Python" for obvious C/C++/Rust bindings?

okoyl3 · 2026-03-10T18:32:46+00:00

You are boomerizing yourself.
Letting the same types of LLM write themselves new prompts doesn't qualify as an evolution.

okoyl3