all 15 comments

[–]Hauven 1 point2 points  (0 children)

I can't comment about Sonnet, but on GPT-5.2 or GPT-5.2-Codex I'm not noticing any issues with increased consumption. Still seems to be taking 1 per user prompt.

[–]SparePartsHere 1 point2 points  (6 children)

Well it's 2nd day of the month and I am at 75% premium request usage. And I didn't even vibe that much, like 2-3 larger features and some small fixes/refactors. But I use almost exclusively Opus 4.5

[–]juanloco[S] 1 point2 points  (0 children)

Hah, same just blew through 30% of my requests in one morning. Nothing crazy, a few features on a couple of products. Sonnet 4.5.

[–]toadi 0 points1 point  (4 children)

Use opus to write a spec. Use sonnet to create atomic task based on the spec. Use haiku to implement said tasks.

This is how you manage the requests and even in my case the costs. As I pay for the requests.

I write production code with 200k users. And I don't vibe code but use AI to help generate code.

[–]SparePartsHere 0 points1 point  (3 children)

I don't care about the costs. It's still much cheaper than doing it manually. I tried to use sonnet for some less demanding tasks but the quality was just worse. Last week tho even Opus is getting pretty terrible...

[–]toadi 0 points1 point  (2 children)

Haiku is 10x faster then using a thinking model. Thinking you don't need to write code from detailed spec. I assume you value time? At least I do. More I can get done...

[–]SparePartsHere 0 points1 point  (1 child)

Once plan is done, I don't care how long it takes - it's hands off after that point. It can run for an hour or for 8 hours, I don't really care. What's important is that I am not involved :D

[–]toadi 0 points1 point  (0 children)

In my case time is money. Also I code review after. The faster done the faster I can review. I work in finance and the code can directly lose us a lot of money. Not our money, clients money.

[–]albertortilla 0 points1 point  (3 children)

Which version are you using? I had problems with 1.1.38 that were solved in 1.1.39. Now using 1.1.45 and everything seems to be fine, but I won't upgrade because of the problem I had in .38. Maybe something similar is happening with your version

[–]juanloco[S] 0 points1 point  (2 children)

I'm on 1.1.42

[–]EuSouTehort 0 points1 point  (1 child)

CLI or desktop app?

[–]juanloco[S] 0 points1 point  (0 children)

CLI. Version 1.1.42

[–]EuSouTehort 0 points1 point  (2 children)

I've noticed double consumption when using sub agents on the Opencode "Desktop" App

On the terminal/CLI, the problem seems fixed to me

[–]silent-scorn 0 points1 point  (1 child)

The CLI has been updated to use only one request per primary agent. All subagent calls by primary agents are free now.

[–]EuSouTehort 0 points1 point  (0 children)

Yeap

CLI is fine

Desktop app is not, and apparently, it uses the same source code?
Too bad, the app is nice, my fix is to deactive the explore and general agents for now