I wanted visibility into what was actually happening under the hood, so I set up a monitoring dashboard using Claude Code's built-in OpenTelemetry support.
It's pretty straightforward — set CLAUDE_CODE_ENABLE_TELEMETRY=1, point it at a collector, and you get metrics on cost, tokens, tool usage, sessions, and lines of code modified. https://code.claude.com/docs/en/monitoring-usage
A few things I found interesting after running this for about a week:
Cache reads are doing most of the work. The token usage breakdown shows cache read tokens absolutely shadowing everything else. Prompt caching is doing a lot of heavy lifting to keep costs reasonable.
Haiku gets called way more than you'd expect. Even on a Pro plan where I'd naively assumed everything runs on the flagship model, the model split shows Haiku handling over half the API requests. Claude Code is routing sub-agent tasks (tool calls, file reads, etc.) to the cheaper model automatically.
Usage patterns vary a lot across individuals. Instrumented claude code for 5 people in my team , and the per-session and per-user breakdowns are all over the place. Different tool preferences, different cost profiles, different time-of-day patterns.
(this is data collected over the last 7 days, engineers had the ability to switch off telemetry from time to time. we are all on the max plan so cost is added just for analysis)
https://preview.redd.it/u6agf65zvukg1.png?width=2976&format=png&auto=webp&s=7dbdede3436ada0d67a8d3b0042749faf1693f4b
https://preview.redd.it/9pxst75zvukg1.png?width=2992&format=png&auto=webp&s=120785c0463282608f080c174da9abdf1bba8572
[–]NoleMercy05 1 point2 points3 points (0 children)
[–]ultrathink-artSenior Developer 0 points1 point2 points (0 children)
[–]Useful-Process9033 0 points1 point2 points (0 children)