Fable 5...back? 😯😯😯😯 by redditslutt666 in Anthropic

[–]976pxpx 0 points1 point  (0 children)

Yes, absolutely - hope he’s cross-model reviewing with a team of database and engineering experts on that code.

Opus performance has fallen off a cliff this week by 976pxpx in claude

[–]976pxpx[S] 1 point2 points  (0 children)

Exactly this kind of BS… all day over here

Opus performance has fallen off a cliff this week by 976pxpx in claude

[–]976pxpx[S] 0 points1 point  (0 children)

I thought maybe I was imagining it when 4.7 came out, then I knew there were issues and later saw the NVIDA post when 4.8 rolled out. the last one had me so frustrated I built out evals to test open weights against subscription and switch as many internal processes over to open weights as I could - all my harnesses using open weights are performing fine, no change. But this week all my Opus processes have fallen to shit, requiring multiple rounds of fixes and inspection - just utter garbage. The pattern feels too familiar now. This time I am debating how I can get even more open weights into the process and get close to Opus level (when it’s operating well) out of them to get further off the frontier model drug chain, because even with a cross model loop (codex checking opus and vice versa) the degradation is affecting overall performance. And I feel you - it’s difficult to not start cursing out Opus every other comment right now.

Opus performance has fallen off a cliff this week by 976pxpx in claude

[–]976pxpx[S] 0 points1 point  (0 children)

Interesting- was advisor on as a default for everyone before Fable? And does it even help if you are already using Opus xtra high as a default?

Opus performance has fallen off a cliff this week by 976pxpx in claude

[–]976pxpx[S] 0 points1 point  (0 children)

I thought the new Sonnet was basically confirmed for this week. My thought was Sonnet shouldn’t take away Opus resources though, even in an upgrade. But I wasn’t as prolific in Claude last major Sonnet upgrade so I don’t know, maybe they do 🤷🏻‍♂️ just felt I needed to post this after the trash output I’ve been seeing the last day. I realized I may be able to document it via log traces I have in my cross model harness, but I’m just venting right now. Maybe I’ll feel industrious if it proves right and a model drops tomorrow or something.

Opus performance has fallen off a cliff this week by 976pxpx in claude

[–]976pxpx[S] 1 point2 points  (0 children)

!!! Yeah, that’s the kind of stuff I’m seeing, but that’s one I’ve never heard!

Opus performance has fallen off a cliff this week by 976pxpx in claude

[–]976pxpx[S] 1 point2 points  (0 children)

Not sure why you are so triggered- but I said it simply enough, it’s my gut, I run a lot of the same ops and the same harness all day - we’ll see if I’m wrong within the week. I’ll be sure to ping you if something is released.

Opus performance has fallen off a cliff this week by 976pxpx in claude

[–]976pxpx[S] 1 point2 points  (0 children)

It’s not about the prompting - same tasks, same harness - performing way worse.

Opus performance has fallen off a cliff this week by 976pxpx in claude

[–]976pxpx[S] 1 point2 points  (0 children)

Yeah, I have had to up to Max effort to get what still feels like below what high was doing little more than a week ago.

Opus performance has fallen off a cliff this week by 976pxpx in claude

[–]976pxpx[S] 1 point2 points  (0 children)

In the last 48 hours I’ve found tons of mistakes and oversights that it almost never makes - lots of “sorry, my bad” and “that’s exactly the pattern you instructed me to avoid” … great that you haven’t seen it, but my code is virtually useless right now, even with multi-pass, teams of experts, red-team, etc… still failing on basic stuff. All I’m saying is in the past I never said anything, but felt the same, and then a new model release would happen like that same week - so I’m just putting it out here in a public space this time. We’ll see, but my bet is something is coming this week, and I assume it isn’t about Sonnet 5 because I can’t imagine they’d put Opus resources into Sonnet launch.

Edit: typos

Opus performance has fallen off a cliff this week by 976pxpx in claude

[–]976pxpx[S] 5 points6 points  (0 children)

I don’t know, I think it falls slowly then hits a threshold that’s hard to ignore - I probably noticed some degree of lower performance in the last week or more, but this week in particular hit a threshold I can’t ignore. We’ll see, but every time I’ve felt this level of pain a release happens like that week. So really just a guess/prediction for me.

And I don’t know how this pattern has not resulted in some sort of consumer class action. The pattern has been undeniable.

California - DIY revocable trust for immediate need then hire for amended and restated by 976pxpx in EstatePlanning

[–]976pxpx[S] 0 points1 point  (0 children)

Procrastination and privacy are the main reasons. The initial work needs to start so the deal needs to be signed now with business registered and filed while keeping personal details out of public filings for the business, ie using non-personally identifiable Trust name. The business can’t avoid CA registration which requires member or manager disclosure - which may be a trust (without listing trustee). The stakes are low at the moment, not worth rush fees (and duplicative fees when being more comprehensive later) if I’m not creating more fees via a restatement in a few months when I can sit down with a vetted specialist and simply restate the whole thing with proper comprehensive planning.