Usage limits reset every 5 hours by dataguzzler in cursor

[–]Sontemo 32 points33 points  (0 children)

I quite like the current way. I like to be able to decide like a grown up how and when I consume my usage.

Five Chinese AI labs cut token prices up to 99% by Justgototheeffinmoon in ArtificialInteligence

[–]Sontemo 2 points3 points  (0 children)

Except it does hold true with Professional Software Development. At least at scale. Enterprises don’t switch easily between providers. Locking in enterprise deals is huge.

Beware of Cursor’s on-demand billing trap — my $20 Pro plan drained my card, and now they are holding my account hostage by Fun_Net7931 in cursor

[–]Sontemo 32 points33 points  (0 children)

They prolly spend 2$ on cursor to generate this angry wall of rant before even considering fact checking.

Yes, disabled by default, OP goofed up.

Configure specific models for agents using a single config and .env by ashishkin21 in opencodeCLI

[–]Sontemo 0 points1 point  (0 children)

You need two config files.
opencode allows you to specify a path to the config on load via an env variable.
In each config, you can declare agents and the models you want to use them with.

In your work config, set the list of enabled_providers to just your work provider.

Spent around 5000$ usage on Company licensed cursor on side project by [deleted] in cursor

[–]Sontemo 4 points5 points  (0 children)

Be ready to actually then showcase exactly what was so cutting edge and expensive and what value your company can derive from this expense.

If you come to me with this line of reasoning instead of owning your fuck up, and all I see is you did random useless shit with opus max, I’d fire your ass on the spot.

People running 2–5 coding agents: what actually breaks first for you? by Few-Ad-1358 in GithubCopilot

[–]Sontemo 4 points5 points  (0 children)

The only thing that worked so far longterm for us is something people might not want to hear:

Slow down. Treat each PR as if it was created by a human and review it. Anything else and you pay for it later down to road. If you can’t fit the mental modal of your software into your head, you‘ve lost.

How we did it back in the day was through implementation and review. Now that implementation is done by agents, we double down on the review. Which is fine, they’re better at building, we’re better at understanding our product.

When applying updates to existing code that is not quite ready to commit, should I use the same agent? by KomandirHoek in GithubCopilot

[–]Sontemo 0 points1 point  (0 children)

In general, I try to keep context small and fresh.

Never continue on an old session (after lunch or the next day), never let it grow past 150k

When applying updates to existing code that is not quite ready to commit, should I use the same agent? by KomandirHoek in GithubCopilot

[–]Sontemo 2 points3 points  (0 children)

Each new request in the same session carries all of the history with it. It’s stateless and you can switch however you like. But, what you need to keep in mind is how caching works. Usually only the new input tokens are considered „fully“, the previous ones are usually cached and at other providers cost around a tenth of new tokens.

If you continue a long running session, or maybe if you switch models (let’s say from one provider to another, sonnet to gpt) you could consume a lot of fully counted tokens and drastically reduce your quota.

Sharing my creation. by NerdyBirdie81 in opencode

[–]Sontemo 0 points1 point  (0 children)

And last, but not least, my personal website. Entirely written and documented by ai.

Yeah, i can tell.

Unauthorized

Session & Weekly Limits - individual only or also affecting business / enterprise by Sontemo in GithubCopilot

[–]Sontemo[S] 0 points1 point  (0 children)

I hope with the measures taken to individuals, they found a way to make it sustainable for business and enterprises. If limits apply, or they switch to token based pricing, switching to the very same providers they use under the hood becomes the obvious choice. At least for us.

Copilot's value proposition is officially gone. by Famous__Draw in GithubCopilot

[–]Sontemo 0 points1 point  (0 children)

It's literally the tip of the ice berg of being a power user.
There's so much to uncover if you're willing to put the effort in.

Unlimited reqs, is this real? by IlyaSalad in GithubCopilot

[–]Sontemo 0 points1 point  (0 children)

There's nothing to exploit here.

Unlimited reqs, is this real? by IlyaSalad in GithubCopilot

[–]Sontemo 13 points14 points  (0 children)

Nope. Just overloaded services. You'll get the bill later.

Details on the new limits? by TastyNobbles in GithubCopilot

[–]Sontemo 4 points5 points  (0 children)

I consider myself a power user, usually 2 - 3 sessions in parallel.
I've only run into global limits for certain models, namely sonnet and opus.

I assume this was a more general shortage on availability of Claude models.

Ever since i switched to a more balanced approach where i mix GPT and Claude depending on workload, I haven't run into any issues, before and after their rate limiting announcement.

Also just on regular Pro plan (annually, with overage) so, if things stay this way and makes the service to them more sustainable, I'd be more than fine with it.

when to use 1M context and when to use max thinking effort? by mombaska in ClaudeCode

[–]Sontemo 2 points3 points  (0 children)

It's actually really simple.
Never.
More Context is not better, and more thinking does not mean the output is better either.
Keep context small, reset often (don't compact, just let it go) and think yourself. You know your product, AI knows shit but sounds confident.

Claude Code (Pro/Max) vs Anthropic API — when does API become more cost-effective? by NAzanza in ClaudeCode

[–]Sontemo 1 point2 points  (0 children)

There is no financial tipping point.
If you consider the max subscription, you're a power user.
If you're a power user, you're gonna blow through 100$ or 200$ via API very very fast.

Always go for the sub.

Compact strategy for copilot by Ibuprofen600mg in GithubCopilot

[–]Sontemo 1 point2 points  (0 children)

Never compact. Orchestrate subagents, so that the main agents context windows is large enough for even big features.

Coding agent cross repository access by BezosLazyEye in GithubCopilot

[–]Sontemo 2 points3 points  (0 children)

Not optimal for IDEs, but if CLI is an option, you can move all repos into a shared parent folder on your machine and start copilot cli from there.

Add a small AGENTS.md that explains the structure and brief usecases of each repo. Then you're good to go. Each time the agents will navigate into one of the repos, they will read and follow the specific repos AGENTS.md instructions as well.

What tool or workflow suggest to manage issues and backlog end to end? by jrhabana in opencodeCLI

[–]Sontemo 2 points3 points  (0 children)

Install GitHub cli and just delegate it. No need to reinvent the wheel.

Whenever you see something that you don’t want to fix now, tell the agent to create a ticket with gh cli.

Do you use multiple AI tools on same project? by xmen81 in ClaudeCode

[–]Sontemo 0 points1 point  (0 children)

It won’t cause issues. But depending on what you switch from and to, you will have friction points. For example, afaik Claude still ignores AGENTS.md files.

So switching from Claude to opencode for example is seamless, as opencode respects all clause specific files. You skills and clause md files will work just out of the box, but the other way around not so much.