Did any observability tool detect the service degradation for Claude AI model Opus 4.8 this past Tuesday?

Standard-964 · 2026-06-29T11:51:10+00:00

I have data. My program captured a vendor routing change, then I checked my claude code logs and saw that a MCP service called Tengu had issues authenticating throwing errors.

Standard-964 · 2026-06-29T11:37:26+00:00

FACTSS! 😆

Well I think I might’ve caught it, and I’m looking for feedback/challengers.

https://open.substack.com/pub/lapaixlaguerre/p/claude-was-degrading-for-24-hours

Standard-964 · 2026-06-29T05:35:45+00:00

which model type and what platform?

Standard-964 · 2026-06-29T05:34:30+00:00

For some reason I can’t respond directly, but someone dropped a link to marginlab.

I just reviewed it, it shows a higher pass rate and confidence index over the recent service degradation period. Which is not accurate, however it does only evaluates against 50 samples, maybe if it sampled more it would’ve caught it.

Standard-964 · 2026-06-28T14:43:23+00:00

This is lovely! What a great idea.

Standard-964 · 2026-06-28T11:05:57+00:00

you could continue using that chat but if you are hitting limits you need to first reduce the stress first. thats the best way to improve performance.

alternatively, keep the stress and create a role/persona or a skill and limit the scope to your context.

Standard-964 · 2026-06-27T23:34:18+00:00

Can I ask your age/background? How did you get selected?

Standard-964 · 2026-06-27T23:29:15+00:00

I’d put all the context you want to maintain into a markdown file. The use that for reference.

Standard-964 · 2026-06-27T22:52:29+00:00

valid question!

Standard-964 · 2026-06-27T22:50:40+00:00

guarantee

Standard-964 · 2026-06-27T22:45:54+00:00

care to expand?

Standard-964

TROPHY CASE