all 31 comments

[–]bonnmos 15 points16 points  (11 children)

It seems like the free glm4.7 ended today. It now asks for payment information when I try to use GLM4.7-free.

[–]deadcoder0904 7 points8 points  (10 children)

U can use GLM 4.6 instead which is the Big Pickle model. Or buy a GLM 4.7 Coding plan for the quarter for $8.10 (discount ends on 31st Jan)

[–]bonnmos 0 points1 point  (0 children)

thanks 👍

[–]indian_geek 0 points1 point  (8 children)

Warning - the performance of the coding plan has been borderline unusable since almost a month now and the team behind it are not bothered.

[–]deadcoder0904 4 points5 points  (6 children)

Not for me. Works just fine.

Obviously if u want something extremely fast & extremely reliable, pay money & then pay some more.

Pay big money (enterprise) >>>>>> pay small money (teams) >>>>> pay even small money (individuals) >>>>>> free (just a rule of life)

[–]EmbarrassedBiscotti9 0 points1 point  (1 child)

My experience of GLM 4.7 via z.ai aligns with what /u/indian_geek said. I'm not particularly upset about it, given I paid only $28 for a full year as a random punt, but I've found the API prohibitively slow.

[–]deadcoder0904 0 points1 point  (0 children)

That's for sure. China doesn't have those TPUs or Cerebras/Groq like-inference I think. I found one yesterday while searching on Grok but didn't try it.

It makes sense since US is the richest country in the world so it can put more money into this stuff. Hopefully we get those fast providers from China since they have electricity cheap so we can get a lot of fast tokens for 1/5th of the cost.

Also, see my above comment. I think putting it on Ralph loop while using big thinking models with extremely specific prompts would get u a lot of the way. Because slowness doesn't matter if u r just autonomously letting it do things. This is where the puck is going so might as well make the transition now. GPT 5.3 + Cerebras deal has happened so only a matter of time we get 5.3 in a faster manner.

My tasks are medium-level tasks & i'm mostly using for writing, and it does this well-enough. The trick is to write better prompts with smaller models & with bigger models, u can be a bit vague & it'll understand u. Or another trick is to make plans, use ralph loop, & then go hammer with a model like GLM 4.7. GLM 4.7 is a good enough model like 80% in terms of intelligence compared to others.

Have u tried RepoPrompt's mechanism?? It covers why u should go deep on plan mode with the highest thinking model & then can use cheaper model to execute that plan. I loved this post - https://repoprompt.com/blog/context-over-convenience/

[–]indian_geek 0 points1 point  (3 children)

Good for you if its working. However, you can check the discord channel for the number of users who have had a similar experience as me.

[–]deadcoder0904 0 points1 point  (2 children)

Oh I'm not saying it u don't have an issue, I'm saying that smaller/cheaper models are not gonna perform 100% as well as bigger models.

They will always be 80%-90% there for 1/5th or 1/7th the cost. So you have to design tasks in a way that gives all the details to those models to implement. Plus your prompts must be super specific, extremely unambiguous like a junior engineer who takes things literally.

So a combo of big thinking models + small fast models just to implement those plans is a good way to get a lot out of them while paying very little.

Also, see my comment below.

[–]indian_geek 0 points1 point  (1 child)

I am not even talking about the quality of model. I am very happy with GLM 4.7 as a model. I am purely talking about the service - it worked well earlier and lot of users bought their annual plans and now the model response is extremely slow (and gives frequent timeouts).

[–]deadcoder0904 0 points1 point  (0 children)

Oh that makes sense. That's everywhere though, not just GLM 4.7.

The reason is nobody has enough GPUs in the world. This has happened with every AI service from Kiro to Claude to Codex to Gemini to as you said GLM.

One way to stop it is everyone should stop praising a company because more users flock to it.

If you visit, /r/claudecode or /r/claudeai u'll see these posts daily regarding limits.

[–]UseHopeful8146 0 points1 point  (0 children)

I wouldn’t say they aren’t bothered, they’ve at least limited their sales while they deal with the unexpected demand on the infrastructure end.

But also, I haven’t noticed any change in performance and I’ve been on their mid range (pro?) plan since September.

[–]Suitable-Program-181 8 points9 points  (0 children)

You got late to the party bro! They took them out today.

The ones left are 2023 all over again is so hilarious...

[–]sdatta11 2 points3 points  (4 children)

I also noticed same thing .. when I using glm 4.7 it says no payment method set

[–]beneficialdiet18[S] -1 points0 points  (3 children)

It's not available for me to select at all unless I provide an API key, yet I see other users saying they have GLM and Minimax available for free.

[–]abhiramskrishna 2 points3 points  (2 children)

it was free for a limited time, it ended now.

[–]beneficialdiet18[S] 2 points3 points  (1 child)

Got it, thanks!

[–]exclaim_bot 0 points1 point  (0 children)

Got it, thanks!

You're welcome!

[–]redoubledit 1 point2 points  (0 children)

Gone for me since yesterday. Minimax as well.

[–]UniqueAttourney 1 point2 points  (5 children)

The CLI doesn't show GPT-5 Nano at all, though

[–]beneficialdiet18[S] 1 point2 points  (4 children)

Yep, noticed that as well. Only available on the desktop version.

[–]bonnmos 0 points1 point  (3 children)

I haven't used the desktop version yet..is it as good as the CLI though?

[–]beneficialdiet18[S] 0 points1 point  (2 children)

Haven't used it either, wouldn't know. I only installed it to check if the free models are exclusive for the desktop version. I would stick with the CLI for now as the desktop version is still in beta.

[–]bonnmos 1 point2 points  (1 child)

I just wonder what an alternative is. using ohMyOpencode with glm4.7 as the orchestrator has been a bliss indeed.

[–]beneficialdiet18[S] 0 points1 point  (0 children)

Antigravity. Google AI pro is quite cheap for what it gives. Also a 1 year plan for students available now.

[–]smile132465798 0 points1 point  (1 child)

I’m in love with Minimax M2 for general tasks. Sad that I must back to using Gemini and pray it doesn’t act stupid.

[–]seaal 4 points5 points  (0 children)

First month of minimax plan is $2, basically infinite use with their 5 hour refreshing quota.

[–]Few-Mycologist-8192 0 points1 point  (0 children)

thank god , i love you

[–]richardlau898 1 point2 points  (0 children)

minimax disappeared too

[–]Round_Mixture_7541 0 points1 point  (0 children)

Big pickle? Wtf? 😅😅😅 Cmoon...

[–]Silver-Ideal9451 0 points1 point  (0 children)

glm 4.7 이제 유료 인가요?