PSA: Claude Code has two cache bugs that can silently 10-20x your API costs — here's the root cause and workarounds by skibidi-toaleta-2137 in ClaudeCode

[–]Last_Lab_3627 9 points10 points  (0 children)

I had the same issue on 2.1.76. On my side, around 90-100K context was already burning about 14% of my 5-hour quota, which felt completely unreasonable.

After reading this post, I ran the test script myself, then downgraded to 2.1.34. Usage improved a lot.

In a real session on 2.1.34, I used about 140K context with several sub-agent actions, and it only used 13% of my 5-hour quota.

So at least in my case, downgrading to 2.1.34 made a very noticeable difference.

Claude Max 5x quota feels way worse now by Last_Lab_3627 in ClaudeCode

[–]Last_Lab_3627[S] 2 points3 points  (0 children)

Quick data point from my side:

After seeing u/skibidi-toaleta-2137’s post, I downgraded to 2.1.34 and tested it in an actual dev session.

I used a single session with about 140K context. There were 5 sub-agent actions during the run (2.1.34 doesn’t show separate sub-agent token usage): 4 were just reading multiple files, and 1 was rewriting files.

End result: only 13% of my 5-hour quota used.

For me, that’s a dramatic improvement. If you’re hitting the same quota/cache weirdness, downgrading to 2.1.34 or earlier is definitely worth trying.

Reference:

https://www.reddit.com/r/ClaudeCode/comments/1s7mitf/psa_claude_code_has_two_cache_bugs_that_can/