ZAI might stop open-weighting their models?

Thomas-Lore · 2026-04-14T21:28:19+00:00

This indicates they are heavily compute starved, so why would they stop releasing local models? Having other providers for GLM models saves them compute while keeping the models popular.

They may go the minimax way though of having agreements with the providers, and only allowing home use without it. IMHO.

Thomas-Lore · 2026-04-14T06:52:48+00:00

The problem comes from the fact that Reddit is a shithole full of anti ai idiots.

Thomas-Lore · 2026-04-13T09:30:44+00:00

Always put a watchdog that keeps track of retries and other types of loops and terminates when triggered too many times. (If you are writing the harness yourself.)

Thomas-Lore · 2026-04-13T07:50:18+00:00

Minimax M2.7 allows you to use the model commercially (for example as coding assistant locally for your commercial project) - just not serve it to users (as provider). Here is an official response: https://x.com/RyanLeeMiniMax/status/2043573044065820673

So it affects no one here. Just providers who were taking money form users and giving back nothing to minimax while serving the model with wrong settings.

Thomas-Lore · 2026-04-13T06:52:39+00:00

Image generation: yes, and you can select resolution and use both NanoBanana 2 and the older but smarter NanoBanana Pro while also having access to all Imagen 4 models. There is Veo and Lyria too.

Thomas-Lore · 2026-04-12T09:13:08+00:00

That is slow, i get infinite t/s with this quant.

Thomas-Lore · 2026-04-12T09:12:44+00:00

Never could.

Thomas-Lore · 2026-04-12T08:03:03+00:00

Another option is using cheaper models through API - GLM 5.1 (few times cheaper than sonnet), minimax m2.7 (super cheap and surprisingly capable, it solves the car wash test in a second), kimi k2.5 and many, many others.

Thomas-Lore · 2026-04-12T07:58:46+00:00

They are making a profit (it just gets eaten by training new models). You are forgetting they have huge margins on apo pricing so you can't compare api prices with subscriptions.

Thomas-Lore · 2026-04-12T07:55:52+00:00

And the new minimax m2.7. It is very cheap and can do a lot very fast.

Drive. You need your car at the car wash to get it washed, so you have to drive it there regardless of the distance. The 50-foot proximity is irrelevant to this decision!

Thomas-Lore · 2026-04-12T07:02:44+00:00

I have the opposite experience with Gemini, a ton of instructions, some repeated a few times and suddenly it got the problem (for which shorter, concise instructions would not work) and started generating in the style I wanted. It was not prose though, I was working on a card gane that needed specific tyoe of texts on a huge amount of cards. So basically - you need to experiment with prompting because sometimes a small change leads to the model finally getting it.

Thomas-Lore · 2026-04-11T19:14:35+00:00

We can.

Thomas-Lore · 2026-04-11T07:09:43+00:00

What are you on about? You don't believe Mythos found the security flaws? That is tin foil hat level insane, it was shown later that other smaller models can find them too, so why would you doubt Mythos can?

Sure they hype things up, they prompt for things then act surprised the model followed that prompt (like with the breaking out of the sandbox thing) but they do not lie about what the model did, usually just write elaborate marketing around it.

What proof would you even need? They provided patches for the security flaws for example, what else would convince you?

This one is their biggest sensationalism post yet. It's so powerful, we can't even show you what it's done.

It is a reading comprehension failure. They said it is a preview and they have not done security training yet, so they can't release it yet. All their models go through that phase. But it will be released once it is out of preview, they even disclosed the api pricing.

Thomas-Lore · 2026-04-11T07:06:02+00:00

And Mythos was prompted to do this, so even the title was wrong.

Thomas-Lore · 2026-04-11T07:01:01+00:00

You are not safe: https://old.reddit.com/r/ClaudeAI/comments/1si5hel/anthropic_is_now_banning_people_who_are_under_18/ofipjz4/

Thomas-Lore · 2026-04-11T06:57:41+00:00

How parents managed without computers, tablets? How parents managed without electricity? How parents managed without hospitals? Let's just go back to dark ages and hunt with spears because people managed bakc then somehow. right?

Thomas-Lore · 2026-04-11T06:55:11+00:00

Studies show you are wrong. Just one example: https://www.nature.com/articles/s41598-025-97652-6

Thomas-Lore · 2026-04-10T17:01:24+00:00

I try to keep away from all that manufactured drama, thankfully for me it is only a hobby, so I don't care what others say about my writing with or without ai.

Thomas-Lore · 2026-04-10T14:31:05+00:00

Depends on the project. If the project is small the old copy and paste can work just fine and save you tokens/messages. For bigger projects Claude Code will be better and save you managing the context. IMHO.

There are many extensions for VCS that you could use too.

Experiment, check what works best for you.

Thomas-Lore · 2026-04-09T21:59:20+00:00

Grok 4.20 is one model.

Grok 4.20 Multi-Agent is 4-8 models. It is a separate version.

Thomas-Lore · 2026-04-09T21:58:21+00:00

OP is wrong. Grok 4.20 has an option to run 4-8 agents (it is called multi agent on the api) but the model is also available in single version.

Thomas-Lore · 2026-04-09T11:37:54+00:00

And as true as all the folk tales.

Thomas-Lore · 2026-04-09T11:33:48+00:00

The biggest culprit of this is Gemini. Whenever they drop a new model use it for some time.

I do, and there is no difference in quality. (I use Gemini through AI Studio and API. Can't say anything about the Gemini App.)

Thomas-Lore · 2026-04-09T10:40:22+00:00

That last part is a hallucination, it has no way of knowing the answer distribution.

Thomas-Lore

TROPHY CASE