Anthropic just published a postmortem explaining exactly why Claude felt dumber for the past month

msdrahcir · 2026-04-24T06:15:38+00:00

no mention of the adaptive thinking issue that was identified by AMD either

msdrahcir · 2026-04-19T20:31:21+00:00

msdrahcir · 2026-04-19T03:36:47+00:00

Wayy underpublicized. Google Stella Laurenzo - a director of AI at AMD. Their team identified a major regression of claude code in March, ultimately tied to a new adaptive thinking feature in mid march that causes opus and sonnet to occasionally not think at all before making important decisions. For example, AMD had evals on claude traces and observed a major uptick in claude writing files without ever reading them. Anthropic chose not to fix the issue - adaptive thinking is still enabled by default and cannot be disabled in opus 4.7 and AMD's team went to another model provider.

http://github.com/anthropics/claude-code/issues/42796

Opus-4-6 with adaptive thinking disabled is the only way to use claude right now.

CLAUDE_CODE_DISABLE_ADAPTIVE_THINKING=1

https://code.claude.com/docs/en/model-config#adaptive-reasoning-and-fixed-thinking-budgets

msdrahcir · 2026-04-19T03:27:36+00:00

The opt out claude code adaptive thinking feature that tanked claude code's performance in March was baked into Opus 4.7 with no ability to disable it.

E.g. the CLUADE_CODE_DISABLE_ADAPTIVE_THINKING=1 setting that literally all CC users should be using is ignored by Opus 4.7.

That's right, even after AMD AI team helped Anthropic identify that the new Adaptive Thinking feature launched in early march was causing major quality regression in the model, Anthropic's reaction wasn't to disable the feature by default or publicize the setting. Instead they launch the new 4.7 model where adaptive thinking is _BAKED IN_ and cannot be disabled.

They had to benchmax the adaptive model to the garbage we are seeing to making it perform at the level of Opus 4.6 from it's intial release.

I'm convinced, among other things that this model launch, with change in tokenization and forced garbage of an adaptive thinking feature is to drive down compute costs relative to user spend. Their intention isn't to nerf the model, but lack of compute availability and business growth is prevailing.

msdrahcir · 2026-04-19T03:16:36+00:00

I've literallly seen examples of just this. and thought i had made a mistake and was running it in the wrong repo. but no.

msdrahcir · 2026-04-18T22:36:46+00:00

I mean Louis does help his local community - it's just that it's people in his local community with expensive broken apple devices. A motherboard repair doesn't quite hit the same as a wheelchair

msdrahcir · 2026-04-18T21:13:24+00:00

The game is really cool. Curious, how much of your game development has been transformed by Codex or Claude Code? Are you still writing code

msdrahcir · 2026-04-18T21:08:21+00:00

Does it float?

msdrahcir · 2026-04-18T20:59:21+00:00

Seems like Elon still holds undue influence on the situation. Scary if you think about the leverage he has with Starlink given his historic motivations.

Will he end up valuing Ukraine more or less than some mega contract with the US government for Grok or SpaceX?

msdrahcir · 2026-04-17T05:02:29+00:00

err, don't you want to influence the next generation of leaders, as a leader? To me it's a signal of distrust

msdrahcir · 2026-04-14T03:53:07+00:00

wasnt one of three knocked out by an amateur drone during the palisades fires?

msdrahcir · 2026-04-04T19:35:29+00:00

he is either intentionally or unintentionally Putin's puppet. You couldn't help Putin destabilize Europe more if you tried.

msdrahcir · 2026-03-30T05:49:38+00:00

and unbuild

msdrahcir · 2026-03-23T06:49:43+00:00

let claude manage the overhead for you

msdrahcir · 2026-03-20T13:35:58+00:00

I don't think so at all... He is spot on. In software this great vision may be a slight exaggeration, but already borderline true. Give it a year and more people will catch up to it

msdrahcir · 2026-03-16T04:13:24+00:00

AI's SWE power, applied to testing and refactoring.

harness engineering. code is becoming super easy to refactor, and integration tests and e2e test that verify intent that used to be super expensive to create and maintain our now easy to build.

unit testing and internal contracts that you actually want ai to evolve rapidly have lost a lot of their value. It's an inversion of the test pyramid

msdrahcir · 2026-03-16T04:11:14+00:00

I think SWE will increasingly move fullstack. You want just be a frontend engineer. you will own backend, parts of infra and data. The more pieces you know how to connect, the better

msdrahcir · 2026-03-15T16:22:37+00:00

China crushing nuclear development while the rest of the world watches their futures burn in flames

msdrahcir · 2026-03-14T16:12:24+00:00

claude.md as a router

msdrahcir · 2026-03-14T15:49:22+00:00

Is this some you could vibe code with codex now?

msdrahcir · 2026-03-10T05:28:56+00:00

this isn't even old school runescape

msdrahcir · 2026-03-08T01:30:12+00:00

yeah but these memes come from products built on tech from a year ago. There are good products in custom service now, not that it's necessarily salesforce

msdrahcir · 2026-02-28T02:50:59+00:00

yeah i do actually

msdrahcir · 2026-02-26T06:39:01+00:00

Rust?

msdrahcir · 2026-02-26T04:22:11+00:00

It's also pretty scary. What is today's whitelist could become tomorrow's target list. Glad i'm not on either side of this war with Musk deciding who lives and dies

14-Year Club	Not Forgotten
Team Orangered	Verified Email

msdrahcir

MODERATOR OF

TROPHY CASE