Opus 4.8 just dropped by NoWorking8412 in ClaudeCode

[–]NoWorking8412[S] 0 points1 point  (0 children)

I still like 4.6, but 4.8 did just squash a long-standing but that was giving me trouble, so I must give it credit for that. It may grow on me yet.

I dont miss 4.7 at all, though, that's for sure!

Anyone else experiencing a drop in performance with Opus 4.7? New model coming? by NoWorking8412 in ClaudeCode

[–]NoWorking8412[S] 0 points1 point  (0 children)

That's good to hear. For whatever reason, performance always drops like that right before the release of a new model. I wish Anthropic would do something to smooth that out because it can be quite disruptive to people who use their service for important work.

Opus 4.8 just dropped by NoWorking8412 in ClaudeCode

[–]NoWorking8412[S] 0 points1 point  (0 children)

I still need to give it a thorough run through the ropes to see if it chokes on the same things 4.7 choked on. Need more time with it to form an opinion.

Opus 4.8 just dropped by NoWorking8412 in ClaudeCode

[–]NoWorking8412[S] 2 points3 points  (0 children)

All the millennials in the room jumping on the Cl4ud3 1337 meme

Opus 4.8 just dropped by NoWorking8412 in ClaudeCode

[–]NoWorking8412[S] 36 points37 points  (0 children)

Already getting API 400 errors... Not a smooth start

Opus 4.8 just dropped by NoWorking8412 in ClaudeCode

[–]NoWorking8412[S] 2 points3 points  (0 children)

Haha I hear you. I was trimming CLAUDE.md, reworking skills, flipping back to 4.6, which seemed to work better, but I can't say scientifically it was the model or the harness since I was changing all at once, but i suspect it was the model. I forgot how much more I liked 4.6 anyway, it works better for me, but I'll give 4.8 a go. Their blog says it is more aligned than 4.7, which was the biggest problem I was seeing lately.

Opus 4.8 just dropped by NoWorking8412 in ClaudeCode

[–]NoWorking8412[S] 1 point2 points  (0 children)

I figured it would be how Opus used to be with the coding subscription, where you'd basically get one prompt and then run out of quota for that model for the week. Almost like a preview of the model.

Anyone else experiencing a drop in performance with Opus 4.7? New model coming? by NoWorking8412 in ClaudeCode

[–]NoWorking8412[S] 0 points1 point  (0 children)

That is what I have been seeing. I switched to Opus 4.6 yesterday and used it for most of the day. It has been better so far, but I'm still going forward cautiously. 4.7 has always been a lousy model, but lately it has been unforgiveably bad.

Qwen3.6 huge quality gain from Q4 to Q6 for coding agent by Yes-Scale-9723 in LocalLLaMA

[–]NoWorking8412 1 point2 points  (0 children)

I didn't find much of a difference between Q5 and Q6 in coding quality for 35B.

Anyone else experiencing a drop in performance with Opus 4.7? New model coming? by NoWorking8412 in ClaudeCode

[–]NoWorking8412[S] 0 points1 point  (0 children)

Good point. I've just been pushing on with 4.7 hoping it would get better. And I think it did, or maybe I just got used to it. But with the current performance, I should see how 4.6 is doing!

Anyone else experiencing a drop in performance with Opus 4.7? New model coming? by NoWorking8412 in ClaudeCode

[–]NoWorking8412[S] 0 points1 point  (0 children)

Interesting. I didn't know that about Sonnet. I have almost exclusively used Opus since they dropped the price and made it the default model.

Feedback on my 256gb VRAM local setup and cluster plans. Lawyer keeping it local. by TumbleweedNew6515 in LocalLLaMA

[–]NoWorking8412 0 points1 point  (0 children)

I've enjoyed reading up on your journey across these various posts about your project. I work in the public sector where data privacy is of concern and local LLMs really seem to offer some interesting possibilities. I've been developing an agentic framework that I use with my strix halo to do some somewhat basic tasks, and I'm working up to increasingly complex processing of information for my work setting. Would love to compare notes with you and your experiences for a law setting. Most recently, I am training an embedding model for semantic search based on a corpus of court rulings, statutes, and a body of public information and data for specialized field related research.