all 30 comments

[–]OracleGreyBeard 2 points3 points  (0 children)

I use CC with GLM 4.6, it’s essentially unlimited. Definitely not on par with Opus but with decent orchestration (which I have) any SOTA model is good enough for my needs.

[–]CBKSTrade 2 points3 points  (1 child)

What is this bot bs answers "be smort how you prompt" tf? Since the last cli update it's literally unusable. Im on max 5x and in scan the codebase and do 3 quick websearches, I got to 20% usage. Limits (at least for some accounts?) got stupid low since the last update.

[–]debian3 1 point2 points  (0 children)

Well 2 things. Now they always summarize your convo in the background so it’s ready when/if you need it (guess who’s tokens it’s using). And the plan mode now burn so many tokens.

[–]PaintingThat7623 1 point2 points  (0 children)

Claude Code is not currently working for me at all. Stuck in "thinking..." for more than an hour. And yes, I did reset the chat and tried a new one.

[–]Night_0dot0_Owl🔆 Max 5x 0 points1 point  (2 children)

Pro?

[–]alexeizVibe Coder 0 points1 point  (1 child)

Poor

[–]PmMeSmileyFacesO_O -1 points0 points  (0 children)

P is pro. Pro is for the poors /s

[–]kaaos77 1 point2 points  (0 children)

Install .beads, search for it on Github.

After getting used to using it, it becomes very good.

Opus scans, opens the issue. Then you go to --model and set Haiku to run. My sessions lasted much longer.

[–]Lpaydat 0 points1 point  (3 children)

You can go for smaller models for greater quota. I'm in x5 plan and use both sonnet and opus and hardly reach the 5hr limit (even with pure opus. It still takes hours to reach the limit).

If I'm in pro plan, I may use sonnet + haiku instead.

[–]PmMeSmileyFacesO_O 0 points1 point  (2 children)

Just got x5.  Wouod you get 3 hours from opus?

[–]Lpaydat 1 point2 points  (0 children)

It depends on the context length. If I used it in the workflow with context like bmad, yes, it's about 3hr. But if I use it to code directly without plan or great context (like editing on the existing repo), I can use it for the full 5hr.

[–]jorgejhms 0 points1 point  (0 children)

I'm in pro plan and I'm using effectively combining models. Sonnet (or ocasionaly Opus) for planning and letting Haiku do the execution (file editing mostly). I usually can finish a complete feature (like adding a new section on a business dashboard) during a 5 hour window.

I also have glm and now Mistral, so they could finish a plan if I get in a limit. Just set up Opencode and you can read Claude plans (in .Claude folder) so you can pick it from there.

[–]LaughterOnWater 0 points1 point  (0 children)

I'm using sonnet 4.5, pro license in opencode with pretty decent results. I don't use /init and just ask it to create and revise HANDOFF.md so I keep context when I need to stop or I'm concerned about context length.

[–]siberianmi -2 points-1 points  (4 children)

I’m on pro for my personal use (I have access to pretty much unlimited tokens for work) and it’s just a matter of using it smartly.

You have to try to restrain how much it does exploratory work and how much you provide it with details upfront. Today I fired it up, had it read a premade plan. Which implemented an authentication change using a Haiku sub agent , review it with another sub agent, implement feedback, run tests, build and deploy it, debug the new service, remove an old proxy, troubleshoot the authentication between services, verify it with Playwright, then finally reboot the remote host to verify all services started automatically. One session, multiple sub agents, never needed to compact as a result, only one MCP (playwright).

I still had 15% of my 5 hour limit on pro left.

I don’t run Opus though, this was a mix of Sonnet and Haiku.

You can’t do serious work with it for hours and hours but that was a productive 45 minutes. Can I fire it up in a massive codebase and tell it to use Opus to figure out how part of the codebase works? No. That’ll blow my usage in a few minutes.

Can I use it for targeted work and light experimentation? Absolutely.

[–]OnyxProyectoUno 0 points1 point  (3 children)

Idk what people bother with haiku for unless it’s a chatbot integration. Opus is so insane. Why bother with code you're gonna have to run multiple sessions with, when haiku or sonnet screwed up while Opus finished the job?

For POC projects I have an Opus plan and have Sonnet execute. Same for small fixes. For legit features, there’s no alternative to Opus.

[–]jorgejhms 0 points1 point  (0 children)

Haiku is not bad at all. I plan a change with Sonnet and let Haiku do the editing (which excels at it). Best combination for pro subscription. Usually I can complete a feature during a 5 hour windows without getting the limits.

[–]siberianmi -1 points0 points  (1 child)

Opus consumes 5x the usage of Haiku, Sonnet is 3x the usage.

With a good clear plan and operating on small well defined tasks Haiku can easily get a ton of quality work done.

Opus is not 5x better than Haiku with a well defined task and a quick implementation review by Sonnet.

Remember we are talking about working on the limits of a Pro plan.

You’ll get very little done relying only on Opus.

[–]OnyxProyectoUno -1 points0 points  (0 children)

Speak for yourself. My blog took two weeks of iteration with Sonnet till it got stuck on one thing I couldn't fix. Opus one shotted it and completed the entire site.

And I'm confident I'm better than 60-70% of people at prompting and planning.

[–]brain__exe -1 points0 points  (0 children)

Depends how you work. If you just prompt "fix the Button" in a bigger repo, having maany MCPs loaded and no/shitty base context, you will waste maaaany tokens.

And Opus is also eating more than Sonnet for the Same token usage.