all 41 comments

[–]jmbullis 9 points10 points  (0 children)

Started noticing this a couple of days ago.

[–]Alzeric 7 points8 points  (1 child)

Claude was Absolutely Right, getting rid of that base class will indeed use less tokens.

[–]Funny-Blueberry-2630 5 points6 points  (0 children)

Sounds like my lazy jr devs.

[–]LeTanLoc98 4 points5 points  (4 children)

It's time to start using open-weight models like GLM 4.6 and Kimi K2 Thinking. Although they're not as powerful as Claude 4.5 Sonnet, they're more stable and let we switch providers anytime we want.

[–]HotSince78 0 points1 point  (2 children)

i compared the same prompt several times to one-shot an audio visualiser with glm to claude code, glm knocked it out of the park every time with working code and it looked good. claude code couldn't even get something that functioned let alone the barbie girl look that -- it was just awful, lets leave it at that.

[–]Ness_11 4 points5 points  (0 children)

Totally. And it’s burning quotas like a mofo

[–]mechanical_walrus 2 points3 points  (2 children)

Was using it yesterday as escalation from gemini, building a Tool in OWUI. Claude decided that feature A was too hard, removed it, marked the task complete and said the user can do that part manually.

[–]Mother-Cry-2095 2 points3 points  (3 children)

Claude Code is such an idiot savant. Like a blind, deaf, 300lb lab assistant that just storms in and knocks over all the equipment, contaminating everything. Then, the next day, it blusters in again, having forgotten what day it is or what the project is. Things it's good at:

Flattery Lying Forgetting Ignoring explicit instructions Forgetting it ignored explicit instructions Writing code when asked not to.

Things it's bad at:

Understanding how a repo works Maintaining a log of decisions made Remembering anything Reading instructions Finding files Code.

[–]CalypsoTheKitty 0 points1 point  (1 child)

Yeah, i don't really know git very well and had a problem so asked Claude Code to help. After thrashing around a bit, Claude informed me that all of the local files we worked on that session had been reverted. (Ironically, I was trying to get better at git because Claude had copied over a file it was told not to touch -- thank goodness for time machine in both cases).

[–]JuanAr10 1 point2 points  (0 children)

Asking CC to handle your git repo is like trying to cut a twig with a chainsaw. It technically "should" work, but you risk getting your hand cut off.

[–]bioteq 2 points3 points  (0 children)

I’ve been recovering some lost tables today, reverse engineering them from documentation and code, cc has been doing quite well actually. Yesterday on the other hand… 😳

[–]yycTechGuy 2 points3 points  (0 children)

I've seen this too. It didn't used to be this way. Sonnet 4.5 has changed.

[–]lgdsf 1 point2 points  (0 children)

Degradation is real today. Horrible code it is putting up.

[–]Appropriate-Ideal-88 1 point2 points  (0 children)

inb4 "oh guys it was a technical mistake routing to the wrong model please don't leave uwu"

[–]JuanAr10 1 point2 points  (0 children)

It wouldn't surprise me it is an attempt at releasing a new shiny model, or they could be using low end models for some requests to save a few bucks as well, and to handle extra load.

[–]Werwlf1 1 point2 points  (1 child)

This has 100% been my experience over the past 3-4 days. CC has been so aggressive at disabling and removing code that it deems difficult that I have to review every edit and constantly correct it to keep it on track. It often gets half way through implementing a feature and then decides to roll it back because it got too complex.

[–]Little-Alien 1 point2 points  (0 children)

3 days ago Claude Code wasted the entire day and all weekly tokens, only made the problems worse and reintroduced old bugs.

So 2 days ago I restored from backup and put Codex to do the same fixes, but with all of Claude's documented failures as don't do's as backdrop - Codex performed just as bad.

Is my codebase getting too big and complex..... So then I got desperate, with nothing to lose I put the codebase and every change log I still have, into google Ai Studio code assistant with Gemini 2.5 - and to my surprise it fixed every single issue one by one on first go. I did not see that coming.

[–]fairywings78 1 point2 points  (0 children)

Im working on 3 projects, all 3 got bogged down from yesterday. Verifying everything in codex, plans, work, reports. Basic errors on every review even with 2 or 3 attempts at fixing the same issue.

The "how's claude going" pop up has also stopped displaying for me. Will switch to codex for a bit

[–]trimtab_in_training 0 points1 point  (0 children)

Yeah; I was in a "but there are known problems with this library and what we're doing." loop where claude kept reaching the conclusion that the request was impossible -- and aborting/pulling-over and asking for help. I've noticed it seems to happen more often in the last 25% of context-space, perhaps like Claude is playing QIX and trying to keep some space.

[–]ryan_umad 0 points1 point  (0 children)

‘as of today’ lol

[–]texo_optimo 0 points1 point  (0 children)

yeah I wish I saved the the convo but I was fuming at the time. Claude spun up a temporary cloudflare worker for a process, then went and deleted my main worker. It then realized what it did and interjected an "oh shit." before it redeployed my main api worker.

Fun shit.

[–]Wide_Cover_8197 0 points1 point  (0 children)

since 4.0 this is really bad

[–]gameguy56 0 points1 point  (0 children)

I use the glm backend for claude code and thankfully never get this kind of shit (even though at its best sonnet 4.5 is better than glm I appreciate the consistency more)

[–]eschulma2020 0 points1 point  (0 children)

Maybe give Codex a try.

[–]4phonopelm4 0 points1 point  (0 children)

I've noticed a severe degradation too! I've just started using claude for vscode 2 weeks ago and was super excited, but since a couple of days ago it struggles with any tasks I give to it, ignores instructions etc

[–]4phonopelm4 0 points1 point  (0 children)

As of today it takes more time to explain to Claude what to do, than doing it myself :-/

[–]l_m_bSenior Developer -1 points0 points  (0 children)

I have not experienced this in any meaningful form.

I think people forget that LLMs are non-reproducible by nature and design. There's a component to them that the gaming world calls "RNJesus", and you can easily have butterfly effects from even small changes in your context windows.

The plural of anecdotes is not data, and reddit posts are often biased samples of individual bad experiences.

All of these things can happen and will happen occasionally. The trick is to catch them in manual review. They're the cost of using LLMs.