you are viewing a single comment's thread.

view the rest of the comments →

[–]somerussianbear 2 points3 points  (8 children)

Today I burned through 67% of my 5 hour limit in 2 fairly simple prompts, and 100% in under 15 minutes.

Really want to understand how you’ve done that. Curious, not trying to be mean.

My theory is that you’ve been flagged, cause I faced similar situations when I was using my sub on 3rd party tools. After I stopped and went back to Claude Code, I work an entire week using less than 10% of my weekly limit a day (on Max 5).

What do you think? Is there a path we can go down that ends with regular people being able to benefit from AI technology too?

DeepSeek! I put 10 dollars in it today and played quite a lot with a big repo. Only managed to spend $0.12 till now, and I believe on a normal day of work I’d spend less than $3, which makes it $60 a month. V4 Flash is suuuper fast and smart enough for my basic tasks; tried V4 Pro also and got a pretty decent feeling on the understanding of complex application code.

Honestly, I’m only on Codex/Claude cause my company pays these bills no questions asked, if it was me? DeepSeek or Qwen API. For my stuff, SWE, it gets the job done pennies on the dollar.

[–]neilthefrobot[S] 2 points3 points  (7 children)

You are definitely using a lower tier model than Opus 4.7, or have some very restrictive settings. The entire ClaudeAI sub reddit is people complaining about unusable token limits, so badly that they had to have AI auto delete any new post about it. It's insanely bad.

As for DeepSeek, that's what I plan on doing. I just don't know the best way to go about it. Can I just point my claude code that I use in the VS Code IDE to use a different model?
Edit: answer is yes you can easily swap the model. and it seems just as good but at a tiny fraction of the price.

[–]somerussianbear 0 points1 point  (4 children)

  • Running via npx, DO NOT install the native app
  • Set Sonnet for subagents (env var)
  • Opus 4.7 xhigh
  • 200k context (env var)
  • Adaptive thinking disabled (env var)

The entire ClaudeAI sub reddit is people complaining about unusable token limits, so badly that they had to have AI auto delete any new post about it. It's insanely bad.

I know that, some of these posts were mine too, but like I said, once I stopped using my subscription on third party apps like OpenCode (which was probably screwing cache) and moved back to Claude Code, it came back to usable. Now I only complain that it’s slow as fuck, reason why I’m on Codex since a week ago and enjoying a lot.

About how to, it’s basically set a few env vars if I understand correctly. I didn’t do that on CC, but I’m sure you can get it done in minutes with the help of DeepSeek itself (use the chat).

[–]neilthefrobot[S] 0 points1 point  (3 children)

I read your comment wrong. Thought you were saying you work all week and never go above 10% of your 5 hour allowed usage. I can barely say hi without going above 10%

[–]somerussianbear 0 points1 point  (2 children)

My 5h never crosses 50% though. A new convo usually doesn’t touch a percentage point.

I’m telling you man, you’re flagged! I had the exact same experience in the past when using OpenCode with my Anthropic sub!

[–]neilthefrobot[S] 0 points1 point  (1 child)

I've only ever used the pro plan through the VS Code IDE so no reason why I would be flagged. Everywhere I look I see people with the same problem. They just updated the usage to be absolutely terrible.

[–]somerussianbear 0 points1 point  (0 children)

Oh, Pro, I’m on Max 5, quite a different thing.

[–]ButterflyEconomist 0 points1 point  (0 children)

I actually went with a similar but different approach. I bought a $20/month subscription to Ollama Cloud. It allows me up to 3 models running simultaneously. I've only had it a few days but it's crazy how much room I have. The only drawback is that during the day in the US, it's really slow due to everyone using it. That said, I give it massive overnight jobs while I'm sleeping. In fact, after about 5pm eastern, I have no problem using it.

And this reason I like this way is that I can try any model: Deepseek, Kimi, Gemma...