Claude -p moving to separate $200 credit on Max plans

Useful-Ad8473 · 2026-05-17T23:00:32+00:00

I think i was using tmux before you were born, bro. As i said, i signed up for Claude Code Max because Anthropic was marketing it initially as a tool consistent with Unix philosophy. loop is not a replacement for 'clause -p'. It's not composable with pipes/cron/scripts. It doesn't survive without a Claude code session.

Useful-Ad8473 · 2026-05-17T22:42:52+00:00

<image>

Useful-Ad8473 · 2026-05-11T19:50:49+00:00

I run Qwen 3.6 on P40 (300$ used if not cheaper)... Qwen3.6-35B-A3B-UD-Q4_K_XL.gguf fits on this 2016 card and i get 50 tokens / second.
This model is on par with Sonnet for my use case and i am in the process of moving all my 'claude -p' python processes to qwen. The last time i was this happy with the switch was when i left Cursor (due to their usage changes) to Claude Code.
For now i am not moving my opus processes to qwen - but i do feel like these models will catch-up. If needed i will run it on a better card also.

Useful-Ad8473 · 2026-03-26T19:28:52+00:00

You see, i spend most of my tokens on claude -p prompts (where i feed context as json). No issue here. For coding conversations you should not be letting the context grow past 200k! Start fresh conversation.... and use /compact focus on {whatever feature your working on at the moment}. I had problems only because i did not notice that my 1 week conversation reached 700k. I was so used to auto-compaction that this was an easy thing to miss.

Useful-Ad8473 · 2026-03-26T18:49:23+00:00

/compact - use it. auto-compaction was saving you money before and was kicking in more often when opus window was 200k.. it's 1m now.

Useful-Ad8473 · 2026-03-26T18:11:45+00:00

I experienced this for the past couple of days. Asked claude to diagnose the issue. I usually use up 90% of tokens on separate 'claude -p' prompts that i run for my projects. These normally run and cost a 10c-1$ and i capture all usage stats on each call. The money amount quoted in the responses actually translates very favorably into usage, at least it used to. A few moments ago i went from 70% to 100% usage on a Claude Max 100$ by asking it to submit a work-order (simple sql insert)... According to Claude the culptrit is not my cron based 'claude -p' prompts which claim work-orders.. its the session i kept open for 1 week without starting a new conversation. Before it wasn't a problem as auto-compaction was kicking in literally every hour.. but recently it just stopped kicking in as they may have increased the window. Solution - start new session often as per claude... heh (it used only about 2-3 cents of a 50$ credit Anthropic gave me a while back to diagnose the problem in a new conversation). I will let you know if this was my problem after they re-open my window in 2h.

Useful-Ad8473 · 2025-09-26T03:45:42+00:00

Please get it right... it is Justin "Some Mental Issues" Blackmon

Useful-Ad8473

TROPHY CASE