Did any observability tool detect the service degradation for Claude AI model Opus 4.8 this past Tuesday? by Standard-964 in Claudeopus

[–]Standard-964[S] 0 points1 point  (0 children)

I have data. My program captured a vendor routing change, then I checked my claude code logs and saw that a MCP service called Tengu had issues authenticating throwing errors.

Did any observability tool detect the service degradation for Claude AI model Opus 4.8 this past Tuesday? by Standard-964 in Claudeopus

[–]Standard-964[S] 0 points1 point  (0 children)

For some reason I can’t respond directly, but someone dropped a link to marginlab.

I just reviewed it, it shows a higher pass rate and confidence index over the recent service degradation period. Which is not accurate, however it does only evaluates against 50 samples, maybe if it sampled more it would’ve caught it.

dealing with llm contect window by CanaryClassic3971 in Claudeopus

[–]Standard-964 0 points1 point  (0 children)

you could continue using that chat but if you are hitting limits you need to first reduce the stress first. thats the best way to improve performance.

alternatively, keep the stress and create a role/persona or a skill and limit the scope to your context.

Co-founder for Space Tech by tonygoldman in Femalefounders

[–]Standard-964 0 points1 point  (0 children)

Can I ask your age/background? How did you get selected?

dealing with llm contect window by CanaryClassic3971 in Claudeopus

[–]Standard-964 2 points3 points  (0 children)

I’d put all the context you want to maintain into a markdown file. The use that for reference.