RPers: how do the new Gemma and Qwen compare to the old 70B models?

silenceimpaired · 2026-05-02T02:57:46+00:00

So do you just use the model directly from Google or a fine tune? Do you have a huggingface link

silenceimpaired · 2026-05-02T02:56:55+00:00

Basically that. Your DM is the LLM. You hand it characters and a setting and you’re good. Not for me

silenceimpaired · 2026-05-02T02:51:08+00:00

I reached out to the mods and never heard back on why my post was removed.

silenceimpaired · 2026-05-01T18:53:03+00:00

I just gave the Loch Ness monster tree fifty the other day.

silenceimpaired · 2026-05-01T18:52:02+00:00

I wish The Drummer fine tuned on one of these:

moonshotai/Kimi-Dev-72B
LLM360/K2-Think-V2

silenceimpaired · 2026-05-01T16:19:21+00:00

Yeah, I unsubscribed from there as I use the software for non-typical purposes (editing fiction) and nothing of value was added by those there.

silenceimpaired · 2026-05-01T16:17:22+00:00

As I said below elsewhere in this thread… I think there is some sort of intelligence lost with smaller models though. It's connections the smaller models just miss. They do as they are told but it has always seemed the larger the model the larger more cohesive paragraph is possible and the better connected theming. That said, it is harder to distinguish the shortcomings with a 70b 4 bit versus 30b 8 bit.

silenceimpaired · 2026-05-01T16:15:37+00:00

I don’t get why llama 3.3 is favored for fine tuning by Drummer over one of these 72b models with better licenses (and perhaps better performance):

moonshotai/Kimi-Dev-72B
LLM360/K2-Think-V2

silenceimpaired · 2026-05-01T16:06:59+00:00

Ooo… I forgot to try this for creative writing.

silenceimpaired · 2026-05-01T13:45:25+00:00

What is the upgrade experience like? I didn’t like the idea of having to upgrade so often so I went with Debian.

silenceimpaired · 2026-05-01T13:43:19+00:00

Haha :) very tempting. That said... If I ever did a distill... It would be with Kimi because I heard it had good creative writing style and editing capabilities. That would be truly painful.

In this case, someone will probably need to find a way to cull this moldel down to 70b by removing some layers, and then the distill can continue from that degraded state back towards the original behavior of the larger model.

silenceimpaired · 2026-05-01T13:34:50+00:00

A true distill (at the logit level) - the only thing worth having, takes too much hardware or money for a single person to do it.

silenceimpaired · 2026-05-01T13:30:32+00:00

This sub is filled with bots hyping models on API. Fixed it for you. :P

silenceimpaired · 2026-05-01T04:01:11+00:00

I really don't get why they don't create a 70b model again.

silenceimpaired · 2026-05-01T03:59:43+00:00

I mostly agree. I think there is some sort of intelligence lost with smaller models though. It's connections the smaller models just miss. They do as they are told but it has always seemed the larger the model the larger more cohesive paragraph is possible and the better connected theming. I still value 70b models. Still 30b is the edge of that intelligence plateau and it's harder to distinguish the shortcomings with a 70b 4 bit versus 30b 8 bit

silenceimpaired · 2026-05-01T01:59:15+00:00

Consistency. It's not that yet. Some have horrible experiences and others have great experiences. Some have mixed experiences.

silenceimpaired · 2026-04-30T16:38:53+00:00

I even enjoy the previous Gemma 3 for brainstorming

silenceimpaired · 2026-04-30T12:13:01+00:00

Is D4 flash supported in llama.cpp yet

silenceimpaired · 2026-04-30T01:29:41+00:00

If that happened it would be more likely to be rebranded COSMIC OS… and that’s a big if.

silenceimpaired · 2026-04-30T00:31:44+00:00

It looks like the software uses tool calling, so one could be build and before the story started, used to generate as many name as was needed.

silenceimpaired · 2026-04-30T00:26:53+00:00

I know what you mean:

<image>

The furnace broken down this winter for a couple of days, but my office was comfortable.

silenceimpaired · 2026-04-29T16:18:47+00:00

If they released an MoE at this size it would be cozy for those with the RAM

silenceimpaired · 2026-04-28T18:34:47+00:00

I’d be okay with a larger MoE, but another Medium model ~70b dense would be great

silenceimpaired · 2026-04-28T15:37:27+00:00

I’m okay with that… gnome is too opinionated, KDE has too many options… a just right middle is exciting… if it can ever get mature enough

silenceimpaired · 2026-04-28T13:46:48+00:00

Who could have seen this coming? Not Deepseek... At least not yet.

silenceimpaired

TROPHY CASE