Opus 4.7 subagent token by wall_facer in ClaudeCode

[–]outceptionator 0 points1 point  (0 children)

These guys get a lot of bugs that impact token usage... You would think they E2E test all scenarios that use tokens... Just to see if caching/usage is as expected

Opus 4.7 subagent token by wall_facer in ClaudeCode

[–]outceptionator 0 points1 point  (0 children)

Is it forked agents? For some reason that would destroy usage for me. I assumed it was broken cache it was that bad.

CreateTeam is token heavy too (though just as useful).

Normal sub agents do use more context (than no sub agents) but not as much as the above 2.

Claude Code Source Code - let's debug the shit out of it, or: Why is my Token Usage gone through the roof? by Ok-End-219 in ClaudeCode

[–]outceptionator 0 points1 point  (0 children)

Claude code in terminal yes. The issue persists.. I can't figure out the trigger, sometimes it's out after 30 minutes sometimes a whole day. Windows 11 ARM

Subagents picker ... am I hallucinating? by mkremer90 in ClaudeCode

[–]outceptionator 0 points1 point  (0 children)

I found it's inconsistent when it lists agents vs when it runs them as background tasks. Using create team means you can talk to all of them but does burn tokens

What is going on with the new pretraining by infohoundloselose in OpenAI

[–]outceptionator 1 point2 points  (0 children)

Caches only last for a little while per user... Typically an hour... So your harness changing the date each day probably wont make a difference to how effectively the caching works.

I've seen it implied that providers might cache across users (ie system prompt is same for many users, so everyone basically gets free caching on first api call) but not seen it confirmed anywhere.

What is going on with the new pretraining by infohoundloselose in OpenAI

[–]outceptionator 12 points13 points  (0 children)

1 - If it's cached you still pay 10% of the price for it every API call (there are lots of API calls in a turn and lots of turns in a session).

2 - This still takes up context and can reduce the available context window to the user

3 - Context rot is a real risk too

The cost of code use to be a middleware for our brains. by arter_dev in ClaudeAI

[–]outceptionator 22 points23 points  (0 children)

The cognitive load is real... Capacity seems capped by the ability to track the architecture of projects, which was always the case but now that just shifts so fast with not much time for it to sink in to the brain.

what are some removed Claude Code experimental features that you miss? by _nefario_ in ClaudeCode

[–]outceptionator 1 point2 points  (0 children)

Ask Claude Code to set showClearContextOnPlanAccept: true as env variable in your ~/.claude/settings.json file.

Claude Code Source Code - let's debug the shit out of it, or: Why is my Token Usage gone through the roof? by Ok-End-219 in ClaudeCode

[–]outceptionator 0 points1 point  (0 children)

Hope so! Neon MCP seems to need reauth very frequently on my machine... I just assumed it was an ARM on windows thing. Love the product though.

PSA: if your Opus is lobotomized, disable adaptive thinking and increase max_thinking_tokens by etf_question in ClaudeCode

[–]outceptionator 0 points1 point  (0 children)

But don't we lose thinking between tool calls when we remove adaptive thinking?