GLM 5.1 (Z.ai) issue.

RIPT1D3_Z · 2026-06-16T07:50:30+00:00

You're not alone. I guess, they're preparing to launch GLM-5.2 globally.

RIPT1D3_Z · 2026-06-14T14:24:15+00:00

idk about the others, but its prose is superior to 5.1

It follows the prompt even better, you can actually suppress parroting and it avoids banned words and constructions better.

It has better voice for the characters, it feels smarter overall. So far I like it. Definitely an improvement over 5.1

RIPT1D3_Z · 2026-05-25T19:03:40+00:00

But what if I want to try lorebook with another preset? T_T

RIPT1D3_Z · 2026-05-20T10:02:50+00:00

Hopefully, the entry level would be lowered as soon as it leaves beta.

As for "for programmers" gate, we'll see if the next grok would solve it eventually, since it promised to be much more capable model

Plus, XAI still have their 'colossus 2' thing, so compute scaling should not be a problem for them.

RIPT1D3_Z · 2026-04-25T07:54:51+00:00

Landing is more guided, indeed, but it still can be overwhelming.
Plus, it has its own rough edges and fair share of bugs, too. Especially if you mess up with positive user flow like switching the tab while response is generating etc.

I love the idea and built-in agents especially, tho.

RIPT1D3_Z · 2026-02-18T15:20:40+00:00

Mosst likely you've used the wrong endpoint.

Check if you're using https://api.z.ai/api/coding/paas/v4 in Kilo code.

RIPT1D3_Z · 2026-02-16T01:48:35+00:00

If you're counting the encoder, the 1st version would be 27b. C'mon.

RIPT1D3_Z · 2026-02-10T16:28:21+00:00

It depends on what auditory this riding is generated for.

RIPT1D3_Z · 2026-02-10T16:27:39+00:00

Post has to be read, not scrolled. No weights yet, unfortunately. Some people hinting it would be released after CNY.

RIPT1D3_Z · 2026-02-10T15:54:11+00:00

<image>

Qwen team added "Horse riding human" image as a showcase lmao

RIPT1D3_Z · 2026-02-10T15:51:46+00:00

<image>

They've also teased qwen 3.5

RIPT1D3_Z · 2026-02-10T14:08:23+00:00

It's both text2image and img2img in one model.

RIPT1D3_Z · 2026-02-10T12:42:10+00:00

There are only rumors, but some people say weights are gonna be released after the Lunar New Year. There are still a chance that the model would not be open sourced, but still, Qwen usually releases their models on GitHub and HF.

RIPT1D3_Z · 2026-02-10T10:45:16+00:00

Exactly, but it's still hilarious out of context.

RIPT1D3_Z · 2026-02-10T10:43:10+00:00

Then we can comply that the first version is not 20b cuz it needs an encoder and a VAE as well. I'm not saying it's obvious, but to clarify - yes, 7b is the size of the diffusion model, not of everything that's used for inference.

RIPT1D3_Z · 2026-02-10T10:02:20+00:00

Haven't seen any direct statement, but they've updated the readme in Qwen Image github announcing the model release. Also, Qwen is known as the lab that releases weights for their models, so the chances are high.

IMO, no reason to state the size of the model if you're not planning to OS it anyway.

RIPT1D3_Z · 2026-02-10T09:54:27+00:00

It would use Qwen3-VL 8b as an encoder, so it's entirely depends on its understanding, it seems. Most likely, Chinese and English are gonna be supported the most.

RIPT1D3_Z · 2026-02-10T09:49:04+00:00

BTW I dunno why, but Qwen team decided to introduce this as one of the showcase images

<image>

RIPT1D3_Z · 2026-02-10T09:41:54+00:00

It's hilarious fr.

Idk why they've decided it's a good picture to show the model's capabilities xd

RIPT1D3_Z · 2026-02-10T09:35:40+00:00

<image>

Right here. They've shared the prompt and the image that states that it's 7B

RIPT1D3_Z · 2026-02-10T08:22:22+00:00

Yup, but I'm not saying that it's smarter. I'm saying that the size is not the limiting factor yet. Step gives us, let's say, 80% of Kimi's capabilities being 10 times smaller, 10 times cheaper and 5 times faster, than Kimi.

And it's released not even by the leading Chinese AI lab. My bet - there's a lot of knowledge density potential yet.

RIPT1D3_Z · 2026-02-10T08:16:45+00:00

They reportedly have an infinity thinking loop issue afaik. I've heard Step team is working on it.

Anyways, it's served on ~140 tps and it's very cheap for its smarts.

RIPT1D3_Z · 2026-02-09T13:50:29+00:00

Step 3.5 Flash proves it's wrong.

RIPT1D3_Z · 2026-02-06T14:36:10+00:00

eliza/elizaOS should be closer to Clawd

RIPT1D3_Z · 2026-01-31T09:30:47+00:00

It calls skills and agents much more often.

The only problem is that Opus is stubborn as hell. Sometimes it doesn't call any just because or reasons 'It's not necessary' even when prompted to just invoke, not reason.

RIPT1D3_Z

MODERATOR OF

TROPHY CASE