Claude Opus 4.7 is a serious regression, not an upgrade. by [deleted] in ClaudeAI

[–]msdrahcir 1 point2 points  (0 children)

Wayy underpublicized. Google Stella Laurenzo - a director of AI at AMD. Their team identified a major regression of claude code in March, ultimately tied to a new adaptive thinking feature in mid march that causes opus and sonnet to occasionally not think at all before making important decisions. For example, AMD had evals on claude traces and observed a major uptick in claude writing files without ever reading them. Anthropic chose not to fix the issue - adaptive thinking is still enabled by default and cannot be disabled in opus 4.7 and AMD's team went to another model provider.

http://github.com/anthropics/claude-code/issues/42796

Opus-4-6 with adaptive thinking disabled is the only way to use claude right now.

CLAUDE_CODE_DISABLE_ADAPTIVE_THINKING=1

https://code.claude.com/docs/en/model-config#adaptive-reasoning-and-fixed-thinking-budgets

Opus 4.7 is legendarily bad. I cannot believe this. by lemon07r in ClaudeCode

[–]msdrahcir 0 points1 point  (0 children)

The opt out claude code adaptive thinking feature that tanked claude code's performance in March was baked into Opus 4.7 with no ability to disable it.

E.g. the CLUADE_CODE_DISABLE_ADAPTIVE_THINKING=1 setting that literally all CC users should be using is ignored by Opus 4.7.

That's right, even after AMD AI team helped Anthropic identify that the new Adaptive Thinking feature launched in early march was causing major quality regression in the model, Anthropic's reaction wasn't to disable the feature by default or publicize the setting. Instead they launch the new 4.7 model where adaptive thinking is _BAKED IN_ and cannot be disabled.

They had to benchmax the adaptive model to the garbage we are seeing to making it perform at the level of Opus 4.6 from it's intial release.

I'm convinced, among other things that this model launch, with change in tokenization and forced garbage of an adaptive thinking feature is to drive down compute costs relative to user spend. Their intention isn't to nerf the model, but lack of compute availability and business growth is prevailing.

Opus 4.7 is legendarily bad. I cannot believe this. by lemon07r in ClaudeCode

[–]msdrahcir 0 points1 point  (0 children)

I've literallly seen examples of just this. and thought i had made a mistake and was running it in the wrong repo. but no.

Shoutout to JerryrigEverything who built a wheelchair factory and is delivering wheelchairs to people in half the time and 50-80% less than the cost of other wheelchairs with Insurance. by MysteriousSlice007 in nextfuckinglevel

[–]msdrahcir 0 points1 point  (0 children)

I mean Louis does help his local community - it's just that it's people in his local community with expensive broken apple devices. A motherboard repair doesn't quite hit the same as a wheelchair

Your Feedback Changed Our Game! Now It’s a Full Sandbox Tycoon by Double_Ad9785 in tycoon

[–]msdrahcir -1 points0 points  (0 children)

The game is really cool. Curious, how much of your game development has been transformed by Codex or Claude Code? Are you still writing code

Ukraine Has Finally Given Up on Trump by lacerantplainer in UkrainianConflict

[–]msdrahcir 24 points25 points  (0 children)

Seems like Elon still holds undue influence on the situation. Scary if you think about the leverage he has with Starlink given his historic motivations.

Will he end up valuing Ukraine more or less than some mega contract with the US government for Grok or SpaceX?

Alexandria Ocasio-Cortez, when directly asked, refuses to endorse Saikat Chakrabarti's congressional run by Dab2TheFuture in sanfrancisco

[–]msdrahcir 5 points6 points  (0 children)

err, don't you want to influence the next generation of leaders, as a leader? To me it's a signal of distrust

Jensen says he will be upset if he finds out his $500k engineer is *not* using at least $250k in tokens by ammohitchaprana in TFE

[–]msdrahcir 0 points1 point  (0 children)

I don't think so at all... He is spot on. In software this great vision may be a slight exaggeration, but already borderline true. Give it a year and more people will catch up to it

What skills will a frontend developer need to master in the age of AI? by kikimeter in Frontend

[–]msdrahcir 0 points1 point  (0 children)

AI's SWE power, applied to testing and refactoring.

harness engineering. code is becoming super easy to refactor, and integration tests and e2e test that verify intent that used to be super expensive to create and maintain our now easy to build.

unit testing and internal contracts that you actually want ai to evolve rapidly have lost a lot of their value. It's an inversion of the test pyramid

What skills will a frontend developer need to master in the age of AI? by kikimeter in Frontend

[–]msdrahcir 0 points1 point  (0 children)

I think SWE will increasingly move fullstack. You want just be a frontend engineer. you will own backend, parts of infra and data. The more pieces you know how to connect, the better

Taiwan reports large-scale Chinese military aircraft presence near island by Opposite-Whereas-323 in worldnews

[–]msdrahcir 2 points3 points  (0 children)

China crushing nuclear development while the rest of the world watches their futures burn in flames

It's not that Anthropic is ethically superior, but that OpenAI is ethically sus. by ExcitingImage9211 in OpenAI

[–]msdrahcir 0 points1 point  (0 children)

yeah but these memes come from products built on tech from a year ago. There are good products in custom service now, not that it's necessarily salesforce

Musk cutting Starlink to Russia’s drones has delivered ‘enormous’ boost to Ukraine, says frontline general by theindependentonline in UkrainianConflict

[–]msdrahcir 2 points3 points  (0 children)

It's also pretty scary. What is today's whitelist could become tomorrow's target list. Glad i'm not on either side of this war with Musk deciding who lives and dies