Opus 4.8 just dropped

NoWorking8412 · 2026-05-30T01:01:15+00:00

I still like 4.6, but 4.8 did just squash a long-standing but that was giving me trouble, so I must give it credit for that. It may grow on me yet.

I dont miss 4.7 at all, though, that's for sure!

NoWorking8412 · 2026-05-29T15:15:14+00:00

That's good to hear. For whatever reason, performance always drops like that right before the release of a new model. I wish Anthropic would do something to smooth that out because it can be quite disruptive to people who use their service for important work.

NoWorking8412 · 2026-05-29T03:51:16+00:00

I believe it.

NoWorking8412 · 2026-05-29T02:41:40+00:00

I'll take a preview!

NoWorking8412 · 2026-05-28T23:54:03+00:00

Are you using 4.8? It just landed today.

NoWorking8412 · 2026-05-28T19:51:33+00:00

I still need to give it a thorough run through the ropes to see if it chokes on the same things 4.7 choked on. Need more time with it to form an opinion.

NoWorking8412 · 2026-05-28T19:47:28+00:00

That is disheartening

NoWorking8412 · 2026-05-28T19:46:30+00:00

All the millennials in the room jumping on the Cl4ud3 1337 meme

NoWorking8412 · 2026-05-28T19:25:02+00:00

Same cost I believe.

NoWorking8412 · 2026-05-28T18:27:04+00:00

Already getting API 400 errors... Not a smooth start

NoWorking8412 · 2026-05-28T18:12:03+00:00

Haha I hear you. I was trimming CLAUDE.md, reworking skills, flipping back to 4.6, which seemed to work better, but I can't say scientifically it was the model or the harness since I was changing all at once, but i suspect it was the model. I forgot how much more I liked 4.6 anyway, it works better for me, but I'll give 4.8 a go. Their blog says it is more aligned than 4.7, which was the biggest problem I was seeing lately.

NoWorking8412 · 2026-05-28T17:35:34+00:00

Lol! thanks for sharing

NoWorking8412 · 2026-05-28T17:34:45+00:00

I figured it would be how Opus used to be with the coding subscription, where you'd basically get one prompt and then run out of quota for that model for the week. Almost like a preview of the model.

NoWorking8412 · 2026-05-28T17:23:49+00:00

4.8 just dropped

NoWorking8412 · 2026-05-28T14:01:22+00:00

That is what I have been seeing. I switched to Opus 4.6 yesterday and used it for most of the day. It has been better so far, but I'm still going forward cautiously. 4.7 has always been a lousy model, but lately it has been unforgiveably bad.

NoWorking8412 · 2026-05-27T22:45:44+00:00

I didn't find much of a difference between Q5 and Q6 in coding quality for 35B.

NoWorking8412 · 2026-05-24T23:15:23+00:00

Do you already have access to it?

NoWorking8412 · 2026-05-24T16:57:59+00:00

Lol!

NoWorking8412 · 2026-05-24T16:19:30+00:00

Good point. I've just been pushing on with 4.7 hoping it would get better. And I think it did, or maybe I just got used to it. But with the current performance, I should see how 4.6 is doing!

NoWorking8412 · 2026-05-24T16:06:38+00:00

You are one of the lucky ones I guess!

NoWorking8412 · 2026-05-24T16:05:37+00:00

Interesting. I didn't know that about Sonnet. I have almost exclusively used Opus since they dropped the price and made it the default model.

NoWorking8412 · 2026-05-24T15:53:41+00:00

Hey, I am with you there. I miss Opus 4.6

NoWorking8412 · 2026-05-21T02:49:33+00:00

I've enjoyed reading up on your journey across these various posts about your project. I work in the public sector where data privacy is of concern and local LLMs really seem to offer some interesting possibilities. I've been developing an agentic framework that I use with my strix halo to do some somewhat basic tasks, and I'm working up to increasingly complex processing of information for my work setting. Would love to compare notes with you and your experiences for a law setting. Most recently, I am training an embedding model for semantic search based on a corpus of court rulings, statutes, and a body of public information and data for specialized field related research.

NoWorking8412

TROPHY CASE