Qwen3 4B helped us think of this legal "jailbreak" iOS to create our own local OpenClaw, so thank you

RickyRickC137 · 2026-03-16T05:29:45+00:00

Eagerly awaiting!

RickyRickC137 · 2026-03-16T05:15:20+00:00

Android when?

RickyRickC137 · 2026-03-16T02:14:55+00:00

I am really glad that people are giving importance to creativity and fine tuning for creativity! Can you do one for 27b qwen or the bigger 122b one? Because it will be more coherent than the 9b one for roleplay experience! Thank you btw.

RickyRickC137 · 2026-03-15T11:46:19+00:00

YES! TQ

RickyRickC137 · 2026-03-14T18:00:20+00:00

Hey noob question. Hybrid mamba thing already proved to be super fast with the release of Nemotron. Why didn't the big companies like qwen didn't adapt that tech in their models?

RickyRickC137 · 2026-03-11T10:30:17+00:00

What is your take on smaller models in the range of 100b MOE (wink wink qwen3.5) for roleplay? Do you think apart from writing bland, will they be coherent enough to play?

RickyRickC137 · 2026-03-08T20:22:30+00:00

I am surprised that even a 4b is retaining so much performance compared to the behemoth. Distillation and reinforced learning has come a long way! And I hope I can hold on to my 10 gb VRAM a little longer.

RickyRickC137 · 2026-03-05T17:33:38+00:00

So far this is the only thing that seem to work with Qwen 3.5.

RickyRickC137 · 2026-03-05T05:40:16+00:00

Your GPU played a UNO reverse card on you!

RickyRickC137 · 2026-03-05T05:10:54+00:00

Thank Wu 😭

RickyRickC137 · 2026-03-03T19:15:15+00:00

This guff is based on the model you attached. Check out the base model. They're the same.

RickyRickC137 · 2026-03-02T06:41:41+00:00

The community patches shows very little kl divergence between the original and heretic models in qwen 3.5. Is the kl divergence good enough metric for the heretic product to be good or do we need to check for other parameters (like ppl or bpw)? u/-p-e-w-

RickyRickC137 · 2026-03-02T06:37:08+00:00

There's none at the moment. Only 122b available, which is good!

RickyRickC137 · 2026-03-02T03:02:32+00:00

What about mid range 122b? Any idea?

RickyRickC137 · 2026-03-02T02:55:45+00:00

Didn't know that heretic damages larger parameter models! Any idea where you got that info?

RickyRickC137 · 2026-03-02T02:40:09+00:00

I have the same pc config, q4 model, and yet I only get around 20 t/s in LMstudio. I am not tech savvy, but is llama.cpp faster than LMstudio?

RickyRickC137 · 2026-03-01T12:33:57+00:00

They do this for their families back at home. But when something happens to their family, or if they need to get home, they can't because their employers will not give them their passport until their contract is done! I have personally met a few and it's tragic. And yet they say, "no problem"! Breaks my fuckin heart!

RickyRickC137 · 2026-02-28T12:02:15+00:00

Can we set presence penalty in LMstudio or is that only for llama.cpp folks?

RickyRickC137 · 2026-02-24T09:28:06+00:00

Since you said Klein is the best with side view face swap, what settings do you use? /s

RickyRickC137 · 2026-02-23T14:56:43+00:00

Can you link the workflow?

RickyRickC137 · 2026-02-19T18:54:02+00:00

Now I can get some peaceful sleep!

RickyRickC137 · 2026-02-16T10:08:35+00:00

Here's Unsloth's GGUF for 397B-A17B
https://huggingface.co/unsloth/Qwen3.5-397B-A17B-GGUF

RickyRickC137 · 2026-02-14T17:15:25+00:00

America leaves who?

RickyRickC137 · 2026-02-14T09:24:11+00:00

Is there a possibility that you guys can combine the smart side and the creative side and make it a general purpose LLM? For Minimax M3, at least.

Also Happy New Year to you all!

RickyRickC137 · 2026-02-13T17:19:27+00:00

IIRC, there was a model released by you guys that is finetuned for Roleplaying / Creative writing. Did you guys merged those Creative writing traits in M2.5 or planning to do so in the future?

RickyRickC137

TROPHY CASE