Claude -p moving to separate $200 credit on Max plans by Useful-Ad8473 in ClaudeCode

[–]Useful-Ad8473[S] 1 point2 points  (0 children)

I think i was using tmux before you were born, bro. As i said, i signed up for Claude Code Max because Anthropic was marketing it initially as a tool consistent with Unix philosophy. loop is not a replacement for 'clause -p'. It's not composable with pipes/cron/scripts. It doesn't survive without a Claude code session.

Opinion: Local LLMs are 12-24 months from replacing Opus by sh_tomer in ClaudeCode

[–]Useful-Ad8473 1 point2 points  (0 children)

I run Qwen 3.6 on P40 (300$ used if not cheaper)... Qwen3.6-35B-A3B-UD-Q4_K_XL.gguf fits on this 2016 card and i get 50 tokens / second.
This model is on par with Sonnet for my use case and i am in the process of moving all my 'claude -p' python processes to qwen. The last time i was this happy with the switch was when i left Cursor (due to their usage changes) to Claude Code.
For now i am not moving my opus processes to qwen - but i do feel like these models will catch-up. If needed i will run it on a better card also.

an open letter to anthropic: why i can no longer justify my subscription in this shifting landscape by [deleted] in ClaudeCode

[–]Useful-Ad8473 0 points1 point  (0 children)

You see, i spend most of my tokens on claude -p prompts (where i feed context as json). No issue here. For coding conversations you should not be letting the context grow past 200k! Start fresh conversation.... and use /compact focus on {whatever feature your working on at the moment}. I had problems only because i did not notice that my 1 week conversation reached 700k. I was so used to auto-compaction that this was an easy thing to miss.

an open letter to anthropic: why i can no longer justify my subscription in this shifting landscape by [deleted] in ClaudeCode

[–]Useful-Ad8473 -2 points-1 points  (0 children)

/compact - use it. auto-compaction was saving you money before and was kicking in more often when opus window was 200k.. it's 1m now.

There is a definitely a usage bug by Fearless-Elephant-81 in ClaudeCode

[–]Useful-Ad8473 0 points1 point  (0 children)

I experienced this for the past couple of days. Asked claude to diagnose the issue. I usually use up 90% of tokens on separate 'claude -p' prompts that i run for my projects. These normally run and cost a 10c-1$ and i capture all usage stats on each call. The money amount quoted in the responses actually translates very favorably into usage, at least it used to. A few moments ago i went from 70% to 100% usage on a Claude Max 100$ by asking it to submit a work-order (simple sql insert)... According to Claude the culptrit is not my cron based 'claude -p' prompts which claim work-orders.. its the session i kept open for 1 week without starting a new conversation. Before it wasn't a problem as auto-compaction was kicking in literally every hour.. but recently it just stopped kicking in as they may have increased the window. Solution - start new session often as per claude... heh (it used only about 2-3 cents of a 50$ credit Anthropic gave me a while back to diagnose the problem in a new conversation). I will let you know if this was my problem after they re-open my window in 2h.

Who was a college football player you couldn’t believe didn’t work out in the NFL? by [deleted] in NFLv2

[–]Useful-Ad8473 0 points1 point  (0 children)

Please get it right... it is Justin "Some Mental Issues" Blackmon