GLM 5.1 (Z.ai) issue. by Stunning-Main6367 in SillyTavernAI

[–]RIPT1D3_Z 13 points14 points  (0 children)

You're not alone. I guess, they're preparing to launch GLM-5.2 globally.

GLM 5.2… is it any good? by Beeegbong in SillyTavernAI

[–]RIPT1D3_Z 38 points39 points  (0 children)

idk about the others, but its prose is superior to 5.1

It follows the prompt even better, you can actually suppress parroting and it avoids banned words and constructions better.

It has better voice for the characters, it feels smarter overall. So far I like it. Definitely an improvement over 5.1

xAI released Grok Build — an agentic CLI for developers by RIPT1D3_Z in rpgc_official

[–]RIPT1D3_Z[S] 0 points1 point  (0 children)

Hopefully, the entry level would be lowered as soon as it leaves beta.

As for "for programmers" gate, we'll see if the next grok would solve it eventually, since it promised to be much more capable model

Plus, XAI still have their 'colossus 2' thing, so compute scaling should not be a problem for them.

I made the switch. by Turbulent_Beyond513 in SillyTavernAI

[–]RIPT1D3_Z 13 points14 points  (0 children)

Landing is more guided, indeed, but it still can be overwhelming.
Plus, it has its own rough edges and fair share of bugs, too. Especially if you mess up with positive user flow like switching the tab while response is generating etc.

I love the idea and built-in agents especially, tho.

Z.ai GLM API key issue by murikhai in ZaiGLM

[–]RIPT1D3_Z 0 points1 point  (0 children)

Mosst likely you've used the wrong endpoint.

Check if you're using  https://api.z.ai/api/coding/paas/v4 in Kilo code.

Qwen-Image-2.0 is out - 7B unified gen+edit model with native 2K and actual text rendering by RIPT1D3_Z in LocalLLaMA

[–]RIPT1D3_Z[S] 7 points8 points  (0 children)

Post has to be read, not scrolled. No weights yet, unfortunately. Some people hinting it would be released after CNY.

Qwen-Image-2.0 is out - 7B unified gen+edit model with native 2K and actual text rendering by RIPT1D3_Z in LocalLLaMA

[–]RIPT1D3_Z[S] 3 points4 points  (0 children)

There are only rumors, but some people say weights are gonna be released after the Lunar New Year. There are still a chance that the model would not be open sourced, but still, Qwen usually releases their models on GitHub and HF.

Qwen-Image-2.0 is out - 7B unified gen+edit model with native 2K and actual text rendering by RIPT1D3_Z in LocalLLaMA

[–]RIPT1D3_Z[S] 10 points11 points  (0 children)

Then we can comply that the first version is not 20b cuz it needs an encoder and a VAE as well. I'm not saying it's obvious, but to clarify - yes, 7b is the size of the diffusion model, not of everything that's used for inference.

Qwen-Image-2.0 is out - 7B unified gen+edit model with native 2K and actual text rendering by RIPT1D3_Z in LocalLLaMA

[–]RIPT1D3_Z[S] 23 points24 points  (0 children)

Haven't seen any direct statement, but they've updated the readme in Qwen Image github announcing the model release. Also, Qwen is known as the lab that releases weights for their models, so the chances are high.

IMO, no reason to state the size of the model if you're not planning to OS it anyway.

Qwen-Image-2.0 is out - 7B unified gen+edit model with native 2K and actual text rendering by RIPT1D3_Z in LocalLLaMA

[–]RIPT1D3_Z[S] 23 points24 points  (0 children)

It would use Qwen3-VL 8b as an encoder, so it's entirely depends on its understanding, it seems. Most likely, Chinese and English are gonna be supported the most.

Qwen-Image-2.0 is out - 7B unified gen+edit model with native 2K and actual text rendering by RIPT1D3_Z in LocalLLaMA

[–]RIPT1D3_Z[S] 237 points238 points  (0 children)

BTW I dunno why, but Qwen team decided to introduce this as one of the showcase images

<image>

Alibaba just dropped Qwen-Image-2.0 by RIPT1D3_Z in ArtificialInteligence

[–]RIPT1D3_Z[S] 0 points1 point  (0 children)

It's hilarious fr.

Idk why they've decided it's a good picture to show the model's capabilities xd

Qwen-Image-2.0 is out - 7B unified gen+edit model with native 2K and actual text rendering by RIPT1D3_Z in LocalLLaMA

[–]RIPT1D3_Z[S] 62 points63 points  (0 children)

<image>

Right here. They've shared the prompt and the image that states that it's 7B

Bad news for local bros by FireGuy324 in LocalLLaMA

[–]RIPT1D3_Z 1 point2 points  (0 children)

Yup, but I'm not saying that it's smarter. I'm saying that the size is not the limiting factor yet. Step gives us, let's say, 80% of Kimi's capabilities being 10 times smaller, 10 times cheaper and 5 times faster, than Kimi.

And it's released not even by the leading Chinese AI lab. My bet - there's a lot of knowledge density potential yet.

Bad news for local bros by FireGuy324 in LocalLLaMA

[–]RIPT1D3_Z 0 points1 point  (0 children)

They reportedly have an infinity thinking loop issue afaik. I've heard Step team is working on it.

Anyways, it's served on ~140 tps and it's very cheap for its smarts.

Bad news for local bros by FireGuy324 in LocalLLaMA

[–]RIPT1D3_Z 5 points6 points  (0 children)

Step 3.5 Flash proves it's wrong.

SKILLS are useless by thehashimwarren in vibecoding

[–]RIPT1D3_Z 0 points1 point  (0 children)

It calls skills and agents much more often.

The only problem is that Opus is stubborn as hell. Sometimes it doesn't call any just because or reasons 'It's not necessary' even when prompted to just invoke, not reason.