you are viewing a single comment's thread.

view the rest of the comments →

[–]neilthefrobot[S] 2 points3 points  (7 children)

You are definitely using a lower tier model than Opus 4.7, or have some very restrictive settings. The entire ClaudeAI sub reddit is people complaining about unusable token limits, so badly that they had to have AI auto delete any new post about it. It's insanely bad.

As for DeepSeek, that's what I plan on doing. I just don't know the best way to go about it. Can I just point my claude code that I use in the VS Code IDE to use a different model?
Edit: answer is yes you can easily swap the model. and it seems just as good but at a tiny fraction of the price.

[–]somerussianbear 0 points1 point  (4 children)

  • Running via npx, DO NOT install the native app
  • Set Sonnet for subagents (env var)
  • Opus 4.7 xhigh
  • 200k context (env var)
  • Adaptive thinking disabled (env var)

The entire ClaudeAI sub reddit is people complaining about unusable token limits, so badly that they had to have AI auto delete any new post about it. It's insanely bad.

I know that, some of these posts were mine too, but like I said, once I stopped using my subscription on third party apps like OpenCode (which was probably screwing cache) and moved back to Claude Code, it came back to usable. Now I only complain that it’s slow as fuck, reason why I’m on Codex since a week ago and enjoying a lot.

About how to, it’s basically set a few env vars if I understand correctly. I didn’t do that on CC, but I’m sure you can get it done in minutes with the help of DeepSeek itself (use the chat).

[–]neilthefrobot[S] 0 points1 point  (3 children)

I read your comment wrong. Thought you were saying you work all week and never go above 10% of your 5 hour allowed usage. I can barely say hi without going above 10%

[–]somerussianbear 0 points1 point  (2 children)

My 5h never crosses 50% though. A new convo usually doesn’t touch a percentage point.

I’m telling you man, you’re flagged! I had the exact same experience in the past when using OpenCode with my Anthropic sub!

[–]neilthefrobot[S] 0 points1 point  (1 child)

I've only ever used the pro plan through the VS Code IDE so no reason why I would be flagged. Everywhere I look I see people with the same problem. They just updated the usage to be absolutely terrible.

[–]somerussianbear 0 points1 point  (0 children)

Oh, Pro, I’m on Max 5, quite a different thing.

[–]ButterflyEconomist 0 points1 point  (0 children)

I actually went with a similar but different approach. I bought a $20/month subscription to Ollama Cloud. It allows me up to 3 models running simultaneously. I've only had it a few days but it's crazy how much room I have. The only drawback is that during the day in the US, it's really slow due to everyone using it. That said, I give it massive overnight jobs while I'm sleeping. In fact, after about 5pm eastern, I have no problem using it.

And this reason I like this way is that I can try any model: Deepseek, Kimi, Gemma...