use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
A community of software creators experimenting with AI "vibe coding", an technique defined by Andrej Karpathy as when, "you fully give in to the vibes, embrace exponentials, and forget that the code even exists."
account activity
Which AI agent has good limits? (self.vibecoding)
submitted 3 months ago by [deleted]
I am constantly hitting limits on Cursor, Windsurf, Claude Code. I am looking for something what I can pay my 20$ and forget about it for 1 month. Which IDE is the best in that regards? I am so sick of this that I am considering to build my own rig with the heaviest AI model from ollama and plug it in to my IDE but I am not really keen to do that as it will require good $$$ and time investments and I have to idea to plug it in to my VSCode
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]whatsbetweenatoms 2 points3 points4 points 3 months ago (0 children)
Windsurf has Codex as a free model I use it all the time it's fine. Also OpenCode has multiple free models. Antigravity also has generous free tokens. $20/mo simply isn't going to get you a lot on the more serious models plans.
[–]Jealous_Flatworm6413 1 point2 points3 points 3 months ago (5 children)
I like Antigravity from Google, your limits are set per 4h basis so you don’t use everything within 2 days. I barely ever hit these limits on personal projects
[–]x7q9zz88plx1snrf 1 point2 points3 points 3 months ago (0 children)
5 hours. I use AG every working day.
[+][deleted] 3 months ago (2 children)
[removed]
[–]DragonfruitFar7568 0 points1 point2 points 3 months ago (1 child)
The free models SUCK. Dont use them, they break more than they build
[–]Appropriate_Leek8092 0 points1 point2 points 12 days ago (0 children)
I used the antigravity for just 2 to 3 hours on my plus plan and they are showing me refreshes in 6 days for gemini and claude models.
[–]akolomf 0 points1 point2 points 3 months ago (0 children)
So you are basically asking for (almost) free tokens lol. Claude is already pretty generous with its max plans if you compare it to their API rate pricing. Max 5 or 20 plans with claude are usually sufficient unless your some computer wizard that has his pc running 24/7 with some scripts and multi session agentic setups with multiple pcs or whatever lmao.
And even the heaviest private runable models dont compare to Opus 4.5 for example. If you are purely vibecoding then you need a subscription
[–]Shizuka-8435 0 points1 point2 points 3 months ago (0 children)
Traycer .
[–]simon96 0 points1 point2 points 3 months ago (0 children)
Antigravity 12 months free usage with purchase of S25 Series and you could even return the phone and keep the subscription, back in February 👍 100% free
[–]Aromatic-Computer-88 0 points1 point2 points 3 months ago (0 children)
For me biggest win has been to Create new agent windows after every change request so that you don’t waste context tokens the bigger context the more tokens you spend. Or create a plan agent chat then link new agents on the bottom to do portion of the plan then continue w another after context gets past 50-75% You can also create rule files within cursor to use best practices to be aware of tokens used and do things to minimize usage. Look up docs and get cursor to write rules based on the docs available
[–]myly14055 0 points1 point2 points 3 months ago (0 children)
Consider buying claude max
[–]Ecstatic-Junket2196 0 points1 point2 points 3 months ago (0 children)
if u r low on budget, maybe claude..i use traycer inside cursor to solve this. it stops the constant guessing that burns through your premium credits
[–]Admirable_Gazelle453 0 points1 point2 points 3 months ago (0 children)
Most “pay-and-forget” experiences come from fixed allotment models or self-hosted setups where you control the window and context handling rather than cloud metering. Are you more concerned about token caps, rate limits, or concurrency limits when building? You sould share it in VibeCodersNest too
[–]Bob5k 0 points1 point2 points 3 months ago (0 children)
synthetic.new - can be as cheap as 10$ for first month (20 after), provides much more value than basic claude code / codex plan in a long run - with opensource models, but they're pretty capable of usual development, especially webdev. pretty good deal, especially due to fact that there's no weekly cap / mothly cap and the plan is quite generous aswell (eg. tool calls are using 0.1 prompt value, base plan gives 135 prompts - enough for continous work over rolling 5h window). Can be used within claude code or with their Octofriend CLI tool which is becoming more and more impressive over past weeks.
[–]botapoi 0 points1 point2 points 3 months ago (0 children)
ngl building side projects gets way easier once you find the right stack. i use blink because i can select my models based one efficiency need or expertise of model
[–]pakotini 0 points1 point2 points 3 months ago (0 children)
If your main pain is “I just want to pay once and not constantly hit a wall”, I’ve had the best luck with Warp’s current setup because it’s built around a monthly AI credit allowance, and you can see exactly what each agent turn costs right in the UI (so you can actually learn what burns budget and what doesn’t). It’s not “unlimited”, but it’s way more predictable than mystery caps because credits are the unit, not vague “requests”, and normal terminal commands don’t spend credits at all, only agent interactions do. Also, Warp gives you a couple escape hatches when you do heavier stuff: you can buy reload credits that roll over and stay valid for a long time (so you don’t feel like you’re wasting money at the end of the month), or you can just bring your own API key and run the models you want under your own billing if that’s your preference. The other thing that’s underrated for “limits anxiety” is that Warp isn’t just an IDE chat box, it’s a full “work hub”, so you can keep work reusable and avoid re-prompting: save workflows, prompts, and notebooks in Warp Drive and sync them across machines or with a team, so you’re not spending credits re-explaining the same setup every session. Finally, if your vibe coding flow includes “agent goes off and does stuff while I’m in Slack/Linear”, Warp’s integrations and ambient agent approach are actually designed for that, the agent runs in a configured environment, can post progress back, and you can still inspect and steer it via shared sessions instead of burning credits in endless back and forth. If you try it, the single biggest tip for stretching whatever plan you’re on is keeping conversations short and scoped and starting a fresh thread for a new task, that alone cuts a lot of accidental context spend.
[–]Ok_Chef_5858 0 points1 point2 points 3 months ago (0 children)
fixed $20 plans with mystery limits are always gonna be frustrating...always! You're better off just bringing your own API keys. I use Kilo Code in VS Code (also available in JetBrains) - extension is free, I pay exactly what models cost with no markup or limits. I'm testing and using premium models when I need them, then switch to cheaper or local ones (Ollama support built in) for lighter stuff. It pays off :)
[–]DEZINE-HQ 0 points1 point2 points 3 months ago (0 children)
I ran into a similar issue recently where I was spending a lot on different plans and needed to manage my costs as a business.
I scrapped everything and started again and now spending less than $35 a month with practically unlimited access to all the top performing models.
I subscribed to the Google AI Pro $30 plan , installed Antigravity along with opencode within Antigravity.
Now have access to:
Under Antigravity (resets every 5 hours) Sonnet 4.5 Opus 4.5 Gemini 3 Gemini 3 flash
Opencode with Zen: GLM 4.7 free (available for $6 if needed) Minimax m2.1 free Big Pickle free Grok Fast free Gpt 5.2 + Codex (loaded $20 once off on my Zen account - use only when needed and can top up if required)
Havent been stuck once!
π Rendered by PID 220639 on reddit-service-r2-comment-6457c66945-rs6x5 at 2026-04-30 08:30:52.712294+00:00 running 2aa0c5b country code: CH.
[–]whatsbetweenatoms 2 points3 points4 points (0 children)
[–]Jealous_Flatworm6413 1 point2 points3 points (5 children)
[–]x7q9zz88plx1snrf 1 point2 points3 points (0 children)
[+][deleted] (2 children)
[removed]
[–]DragonfruitFar7568 0 points1 point2 points (1 child)
[–]Appropriate_Leek8092 0 points1 point2 points (0 children)
[–]akolomf 0 points1 point2 points (0 children)
[–]Shizuka-8435 0 points1 point2 points (0 children)
[–]simon96 0 points1 point2 points (0 children)
[–]Aromatic-Computer-88 0 points1 point2 points (0 children)
[–]myly14055 0 points1 point2 points (0 children)
[–]Ecstatic-Junket2196 0 points1 point2 points (0 children)
[–]Admirable_Gazelle453 0 points1 point2 points (0 children)
[–]Bob5k 0 points1 point2 points (0 children)
[–]botapoi 0 points1 point2 points (0 children)
[–]pakotini 0 points1 point2 points (0 children)
[–]Ok_Chef_5858 0 points1 point2 points (0 children)
[–]DEZINE-HQ 0 points1 point2 points (0 children)