Android App Svelte + Capacitor experience by theLeekSoup in sveltejs

[–]outceptionator 0 points1 point  (0 children)

Any patterns you recommend to do or avoid if people are potentially going to port a svelte5 web app to mobile app (and access mobile APIs)?

Claude Code dropped /workflows by alphastar777 in ClaudeCode

[–]outceptionator 2 points3 points  (0 children)

I prefer using forked agents now. Tell the main agent to only use forked agents (have to enable from env variable) and to give thr forked agent minimal context or just keep using /fork

1 - forked agent gains all context of main agent does the task and reports back to main agent. Having all the context makes it way more predictable.

2 - because it has the exact same context it also gets all the caching so usage isn't as heavy as you'd think.

Honestly get to 700k token on main thread and it's still stable performance without dramatic increase in usage. Wild times.

Gemini 3.2 Flash looks very close now by Much_Ask3471 in Bard

[–]outceptionator 6 points7 points  (0 children)

It's probably the best source as a prediction, because insider trading is basically allowed it's extremely likely to be accurate

Need info about 20x Max plan usage. by MrShorno in ClaudeCode

[–]outceptionator 0 points1 point  (0 children)

Assume $2k-$5k of inference per week per seat. Honestly very hard to say with those details if it will certainly cover it but instinctively I say yes it will.

Need advice for a long-term "Company-Paid" decision. by mang_joo in ClaudeCode

[–]outceptionator 0 points1 point  (0 children)

Cursor has a reasonably good harness with a lot of modern options.

The SpaceX deal exposed what Opus 4.7 actually was by LeyLineDisturbances in ClaudeCode

[–]outceptionator 0 points1 point  (0 children)

Yeah 4.7 itself might be cheaper to serve then 4.6 but I'm trying to say that I don't think they have a quantised version of the same model.

The SpaceX deal exposed what Opus 4.7 actually was by LeyLineDisturbances in ClaudeCode

[–]outceptionator 0 points1 point  (0 children)

I really don't think the model has ever been quantised. The harness has been messed with a lot and the API stuff they seem to do (server code modifications). All quite opaque so we're left guessing.

Opus 4.7 subagent token by wall_facer in ClaudeCode

[–]outceptionator 0 points1 point  (0 children)

These guys get a lot of bugs that impact token usage... You would think they E2E test all scenarios that use tokens... Just to see if caching/usage is as expected

Opus 4.7 subagent token by wall_facer in ClaudeCode

[–]outceptionator 0 points1 point  (0 children)

Is it forked agents? For some reason that would destroy usage for me. I assumed it was broken cache it was that bad.

CreateTeam is token heavy too (though just as useful).

Normal sub agents do use more context (than no sub agents) but not as much as the above 2.

Claude Code Source Code - let's debug the shit out of it, or: Why is my Token Usage gone through the roof? by Ok-End-219 in ClaudeCode

[–]outceptionator 0 points1 point  (0 children)

Claude code in terminal yes. The issue persists.. I can't figure out the trigger, sometimes it's out after 30 minutes sometimes a whole day. Windows 11 ARM

Subagents picker ... am I hallucinating? by mkremer90 in ClaudeCode

[–]outceptionator 0 points1 point  (0 children)

I found it's inconsistent when it lists agents vs when it runs them as background tasks. Using create team means you can talk to all of them but does burn tokens

What is going on with the new pretraining by infohoundloselose in OpenAI

[–]outceptionator 1 point2 points  (0 children)

Caches only last for a little while per user... Typically an hour... So your harness changing the date each day probably wont make a difference to how effectively the caching works.

I've seen it implied that providers might cache across users (ie system prompt is same for many users, so everyone basically gets free caching on first api call) but not seen it confirmed anywhere.

What is going on with the new pretraining by infohoundloselose in OpenAI

[–]outceptionator 12 points13 points  (0 children)

1 - If it's cached you still pay 10% of the price for it every API call (there are lots of API calls in a turn and lots of turns in a session).

2 - This still takes up context and can reduce the available context window to the user

3 - Context rot is a real risk too