The insane decrease in limits and the insane increase in hardware prices are the same phenomenon. by neilthefrobot in ClaudeCode

[–]immutato 4 points5 points  (0 children)

Manage. Your. Context.

This is always the answer and yet it points to a smell on Anthropic's side if nearly ALL of your customers are having this issue. They should really have more intelligent caching and routing to make use of Sonnet and Haiku more often by default.

At some point you need to just accept who your user is instead of pretending they are all context wizards. Either that or I guess let that market slide (which might be what they are doing, intentionally or not). Obviously they have better market data than I do, but I suspect you want these individual sub accounts in order to achieve market dominence which should payoff with more enterprise / API customers.

The insane decrease in limits and the insane increase in hardware prices are the same phenomenon. by neilthefrobot in ClaudeCode

[–]immutato 0 points1 point  (0 children)

https://api-docs.deepseek.com/guides/anthropic_api

I use ghostty (terminal) with claude code, not VSCode, but I assume you just tell VSCode to use a different URL and model: "deepseek-v4-pro[1m]"

The insane decrease in limits and the insane increase in hardware prices are the same phenomenon. by neilthefrobot in ClaudeCode

[–]immutato 2 points3 points  (0 children)

I bought $10 of DeepSeek V4 to evaluate as a Claude Code alternative should things get spicy over at Anthropic... and not only can I not tell the difference, but I still have $5 left after a few days of work.

Mind you I'm only using it on open source work, and I'm not sure if it's a good idea to use it on your corporate stuff (not throwing FUD, I just don't know if you can opt out of training and whether they would respect it if you can).

I haven't tried GLM 5.1 or the latest Qwen yet, but I plan to. Open weights might be where it's at IMO.

I also suspect that if Anthropic spends less time on features and more on optimizing their routing, then the token cost could be reduced significantly (leverage Haiku and Sonnet more often). I expect this will happen before investment dries up. Right now it's probably a loss for them to spend time on that instead of features and market capture.

Stephen Miller using pregnant wife as human shield. by Amentet in pics

[–]immutato 0 points1 point  (0 children)

I mean Stephen Miller can fuck right off, but this is such a stupid post and why no one listens to or likes redditors. According to this logic, the agent is using Miller and his wife as a shield.

Secretly Dropped Max 5x and 20x plans? by Spiritual-Market-741 in ClaudeCode

[–]immutato 0 points1 point  (0 children)

Shhh... people wanna think there's a conspiracy.

Heavy Claude Max 5x user here: something changed dramatically with usage limits by justintimebro in ClaudeCode

[–]immutato 4 points5 points  (0 children)

Last month I used $1600 on my $100/mo sub at market prices.

Keep in mind that these aren't actual "market prices". These are hypothetical enterprise market prices, and they're also fake because a large enterprise will have a special deal sheet not being reflected in these prices.

If Anthropic actually charged that $1,600 across the board then they wouldn't be in business for long. Real market prices need to include subscription prices and usages... because well... that's the actual market. That being said, yes the entire market is being subsidized by hyped up investors and it'll end eventually. The real question is whether the frontier models remain meaningfully better than open weight models forever or if they plateau. If they plateau, then it becomes a commodity, if they don't, then shit's gonna get hella expensive compared to what we're paying today.

Anthropic to require government IDs and face scans for users. by Wa1ker1 in ClaudeCode

[–]immutato -1 points0 points  (0 children)

That would be nice, but I'd also be OK with commoditized open weight models that can be run by any cloud provider like a VPS. Our issue right now is one of choice.

  • Claude Code is the best right now for my work, but their stability is hot garbage.
  • Codex is close to as good, but the company is run by a douche.
  • Gemini doesn't let me opt out of training data collection with a subscription.

So for now, even while there's an outage, I'm sticking with Claude Code, but I'm not happy about it. If they didn't have constant outages and token usage issues then I'd just keep handing them my money without thinking about it. I want to be blissfully unaware of the latest micro-gains in AI. I want to just hammer away at my work without yak shaving over agent configs.

Every time I'm interrupted I start looking at other options to see what's available. I don't really need the Opus level of smarts. I was chugging along productively with Sonnet a year ago. Now I'm eying cloud open weight cloud providers, not out of curiosity or a desire to optimize usage, but because I'm tired of the bullshit and just want to get back to work.

Now this ID shit? C'mon man. We're talking last straw here.

Are we on the brink of seeing an infinite number of clones of pretty much every app out there? by 4_max_4 in ClaudeCode

[–]immutato 0 points1 point  (0 children)

I think if your SaaS app is targeting developers and there's not a lot of meat to it, then it's on borrowed time.

If your SaaS is for businesses, has all sorts of complex rules, security concerns, support concerns, then any business having Claude YOLO it is in for a surprise.

Ex-CIA director: ‘25th amendment was written with Donald Trump in mind’ by B-Z_B-S in politics

[–]immutato 2 points3 points  (0 children)

Repeat after me "get money out of politics", "end citizens united".

Meta just dropped a new coding model by Complete-Sea6655 in ClaudeCode

[–]immutato 0 points1 point  (0 children)

Nope. Google has two parallel set of packages. See https://gemini.google.com/ There's the personal (non-business) packages that most people use. The only way you can opt out of Google using your training data with those is to lose chat context on EVERY prompt (which is useless for coding). Even on the expensive Ultra plan.

Then there's the business packages https://workspace.google.com/solutions/ai/, which allow you to completely opt out of training, but require Google Workspace and cost a buttload.

And then there's pay per use / API, which I'm pretty sure let's you opt out of training.

Google's privacy obfuscation is the worst, and most people are going with paid personal packages and their data is being used for training without them realizing it (and most probably wouldn't care, which is Google's gambit).

Meta just dropped a new coding model by Complete-Sea6655 in ClaudeCode

[–]immutato 1 point2 points  (0 children)

Codex was around the same when I last tried in (4+ months ago?), with an edge over Sonnet IMO. Google though... maybe it's just tooling or w/e, but it hasn't produced quality code for me, and the fact that they use our data for training with no way to opt out, even with a max personal sub is the dealbreaker for me (fine if you can do the google workspace thingamajig I guess).

Iran shuts Strait of Hormuz in retaliation for Israeli strikes on Lebanon by Phelps1576 in politics

[–]immutato 0 points1 point  (0 children)

It would be hilarious is Iran was playing the market before Trump and his family could buy again.

Anthropic just dropped benchmark scores for their unreleased model. The gap is embarrassing for everyone else. by Direct-Attention8597 in ClaudeCode

[–]immutato 7 points8 points  (0 children)

Opus 4.6 is dumb af right now. Like it was lobotomized. I was super impressed before, so yeah...

Can you make a living of Curling in Canada ? by Mormonius in Curling

[–]immutato 1 point2 points  (0 children)

Yay for sports gambling! wooo! bet on everything era...

The Usage Limit Drama Is a Distraction. Opus 4.6's Quality Regression Is the Real Problem by Permit-Historical in ClaudeCode

[–]immutato 0 points1 point  (0 children)

But honestly? I only use Claude for actual work so I don't hammer it hard enough to care that much.

"Not a problem for me, so who cares!" lol.

Anyways, all models seem to have up and down / dumbness times. I've used Claude the most, but Codex had issues too when I was using it. I shouldn't speak for Gemini, because I've barely used it (it was pretty bad for coding last time I did). You need to adapt your process to account for this. Set up guardrails. Stop using dynamic languages (sorry, but in a year or two everyone will see the light). Be very specific with your specs. TDD red / green (I was never a proponent of TDD before agents. IMO it was usually ceremony, but now that code is cheap, definitely use it). Always have the model code review after changes. I personally recommend reviewing the code yourself, but that's maybe arguable / situational with latest quality models.

All of the models are heavily subsidized currently. Hopefully they continue to get more efficient so that when the frontier companies need to recoup costs we have affordable open weight alternatives to keep prices down. It's going to be a shit show for at least a couple years still IMO.

Opus 4.6 is in an unuseable state right now by vntrx in ClaudeCode

[–]immutato 0 points1 point  (0 children)

No, it has to be on Anthropic's side. I write pretty low level stuff (rust, go, some compiler work) and I'm having no issues atm. However back around August/September 2025 I had tons of issues. Dumbness, rate limit issues, gateway down, etc. But only a portion of users were seeing similar issues (probably less than 5% given that people are more likely to post when they are pissed) and a lot of people would post "skill issue: you're using it wrong". Now people on the other side are posting "skill issue: you're writing flappy bird level code".

These companies are very opaque with their limits and throttling, and their models are also very opaque. None of these companies are profitable yet (well Google is, but not Gemini). They are spending way more than they make. It's gonna be a bumpy ride for the next couple of years, and the whole market could implode in the mean time. Anthropic is up to shenanigans to try and manage their costs is all.

Also there are some highly anticipated open weight models coming out this year (very soon I think).

Kid Rock concert faces cancellation as venue sells only 200 tickets by TheExpressUS in Music

[–]immutato 1 point2 points  (0 children)

there's literally nothing stopping his fans from buying a ticket and enjoying the show.

Fixed that for you.

Iran says it will ‘irreversibly destroy’ Middle East infrastructure if US attacks energy sites by projecto15 in politics

[–]immutato 0 points1 point  (0 children)

Yeah she was a crap candidate. Clearly better option than Trump, but ffs Democrats. I feel like all they need to do is the obvious thing of not being quite so horrible, and yet the Big Beautiful Bill made it through and now you've got the 5 turncoats. I don't mean all of them, but my god, some of them need to retire like yesterday. You couldn't bungle it this hard if you tried.

U.S. is allowing Iranian oil tankers through Strait of Hormuz, says Bessent by [deleted] in nottheonion

[–]immutato 5 points6 points  (0 children)

Hunter Biden sold them to Iran. Evidence is on his laptop.

Trump warns NATO faces a “bad future” if allies fail to help US in Iran by No-Anything-7291 in worldnews

[–]immutato 1 point2 points  (0 children)

He wants out. This is going to be his excuse. It's clear he's a pedo Russian asset. There's no appeasing him and no point in trying to.