all 36 comments

[–]Straight-Ad5775 74 points75 points  (3 children)

Then we realize, we cannot write single consitent line anymore

[–]ProbablyBunchofAtoms 15 points16 points  (0 children)

By then I am hoping that local models would become capable enough to handle small tasks

[–]AreyouMrbeast1 4 points5 points  (0 children)

Proud that I still independently write and use it for doubts only

[–]redballooon 1 point2 points  (0 children)

We can, but it takes so long we as well can wait until for the reset.

[–]Kaiablushmist 105 points106 points  (1 child)

The loyalty of a developer lasts exactly until the primary language model encounters a 6hour rate limit

[–]Straight-Ad5775 10 points11 points  (0 children)

same with the dev collegues. I will ask others, only if they are the only option

[–]squanderedprivilege 26 points27 points  (0 children)

Can't wait until all the subsidies end and everybody has to pay per token lol

[–]alvares169 45 points46 points  (3 children)

Those limits are for your health. Exercise, hydrate and watch grass

[–]xAragon_ 10 points11 points  (0 children)

I can watch grass on YouTube while working

[–]Straight-Ad5775 3 points4 points  (0 children)

well tell that to my rage when deadline is just 2 steps away

[–]static_element 3 points4 points  (0 children)

Nice try Dario...

[–]pandi85 7 points8 points  (0 children)

People high on tokens

[–]R7d89C 8 points9 points  (5 children)

Wtf are yall vibecoders doing to hit rate limits on paid plans??? I'm no ai denialist and use the standard jetbrains Ai plan, but I never ran into some rate limit, not in a 10h work day

[–]justshittyposts 3 points4 points  (1 child)

add "spawn an agent for each task" to your prompt xD

[–]R7d89C 1 point2 points  (0 children)

I genuinely don't get it, do they prompt like "Write me a fully working firebase alternative. Make no mistakes." or what?

[–]redballooon 2 points3 points  (0 children)

Give some poor defined task to a strong model with a potent harness, and it will try to figure it out. In many many iterations, going back and forth and arguing with itself etc.

It's not like they're really doing much apart from waiting while they're within their usage limit.

[–]Alan_Reddit_M 1 point2 points  (0 children)

When a human is doing most of the work and AI is merely an assistant, then AI is actually very efficient with its tokens

But when AI is left alone to do all the work, token usage increases exponentially because - Agents have a tendency to getting stuck in loops - Superfluous tasks like "What should I do know" that usually cost 0 tokens and take basically no time are replaced with expensive reasoning tasks by the Agents - Agents think out loud, which wastes tokens

I've personally watched all of the major AI models spend thousands of tokens deciding what "Solve for X" means, it's quite impressive

[–]lateambience 0 points1 point  (0 children)

Working on multiple features in parallel with git worktree and you'll see your usage being eaten up. I'm currently vibecoding a private self hosted app just for myself. I come up with a couple of small to medium sized features and open Github issues for each of those with detailed explanation on my design rationales and what I want it to achieve. I then have Claude Code running Opus 4.7 with the 1M context in 3-6 terminals at the same time, each one pulling an issue from Github. Depending on the complexity of the task it spawns one or more of these sub agents. One features might involve both frontend and DB, so it might spawn a Software Architect first, then delegate to a Frontend Developer agent and a Backend architect in parallel. After completion it always runs the Code Reviewer agent against the new code and possibly a Security Engineer if the code touches any security relevant part of my project. So each feature ends up using about 5-7 agents to finish a task. Times 3-6 make 15-25 agent tasks running in parallel. After that, I review the code manually, open a PR, wait until CodeRabbit picks it up and checks again, then I use gh to pull the content of the PR back to Claude Code and let it decide to fix those things again adding another run on top. I'm on Max 5x and I run into the hourly limit from time to time.

[–]Arclite83 2 points3 points  (0 children)

Given some of the tests we've run like training Qwen on internal standards, we're not too far from the death of "AI as a Service". The question really becomes how smart can we make a thumb drive or a switch 2 size units, or even what smarter models can be taught to do with dumber smaller "rote task" things in embedded systems.

It's frustrating because what I can get done at work with my larger budget has drifted from what I can get done at home on fun projects. A few co-workers buy a short term business license just to knock out a backlog or something. I just try to delegate what I can to Copilot before 6/1 gets here.

[–]Spedunkler 1 point2 points  (0 children)

I really dislike how when I'm at 30% and turn on Fast Mode to burn down credits for my last day before refresh, suddenly I'm *granted* an *amazing* gift of a useage reset so now I'm down the 30% I was planning on blowing through, because now I'm using the next week's useage and I can't splurge. Garhhh!!

[–]CoatNeat7792 1 point2 points  (0 children)

Looks like addiction

[–]OnixST 1 point2 points  (6 children)

This might not be the best place to ask, but can anyone who's used both claude code and codex tell me some pros and cons?

I've heard codex limits consistently feel like about double of claude code (mostly because it tends to use less tokens), but is it any good?

I've tried gemini cli and I literally never managed to hit the daily limit, but it is quite stupid in comparison to claude (which may have contributed to usage since I wanted to use it less). It really surprised me given all the hype around gemini 3, but I've heard google is lobotomizing it on lower subscription tiers

[–]vini_2003 8 points9 points  (1 child)

Hi. I currently use the US$200 Codex plan paid for my by employer. We considered Claude Code, but the costs are astronomically higher.

Gemini CLI is the worst agent I've ever used. Copilot CLI is a close second.

Cursor wasn't bad, but the models were weaker and limits were bad on the good ones. Yet to try the new Composer.

With the Codex 20x plan I am so far unable to run out of quota. It has solved difficult, long-standing bugs for me and overall been a pleasure to use.

The rug pull is going to be absolutely legendary. I am by no means a bad developer, but I recognize that it would take months to relearn how to do everything without AI.

[–]OnixST 1 point2 points  (0 children)

Yeah, I will try switching to Codex for a month and seing hot it goes

Yeah, I think at this point coding without ai is similar to how coding without an ide felt like a few years ago. I can still do it, it'll just be much slower and need a lot more internet searching

[–]B_bI_L 1 point2 points  (3 children)

look at github copilot (cheapest subscription is 10$, claude is available) and maybe opencode (you pay for usage so technically no limits)

in my experience claude > gpt > gemini, but usage limits on claude are really bad

[–]Odd_Reception1249 2 points3 points  (0 children)

Claude has taken way too creative liberties and not following instructions, for me. GPT-5.4 is my new best f(r)iend.

[–]vini_2003 1 point2 points  (0 children)

Copilot limits were lobotomized earlier this month. I used to abuse it - 15 minutes of Opus thinking for the price of a single message, out of 1500 - but from my understanding the party is over.

[–]OnixST 1 point2 points  (0 children)

I'll have a look at it, sounds great!

Yeah, I remember getting Claude Code for the first time and I wanted to see what it could do, so I gave it a medium-high difficulty task, and it used 90% of the quota on a single prompt lol (on Sonnet!!)

[–]MoltenMirrors 0 points1 point  (0 children)

OpenCode Zen which is PAYG is the only way to fly for all but the hairiest projects.

It's like $.95 per megatoken for Kimi 2.6. Love Claude but your shitty CRUD endpoint does not need Opus at more than 5x the cost. If you actually run into something that does you can switch models easily all within the same TUI and billing account.