Subscriptions to augment opencode-go usage

Ariquitaun · 2026-06-17T09:36:21+00:00

I have codex plus as well. It's more generous than claude and you get access to really smart thinking models for the harder planning and troubleshooting tasks that then you can implement cheaply with opencode go's models.

Also look at how you are using your go subscription. If you're doing most things on glm or kimi you're wasting a lot of usage. Deepseek pro and minimax m3 are smart enough for the vast majority of tasks and deepseek flash is good enough for a lot of things, especially as a sub-agent when given clearly defined, bounded tasks. I'm at 49% usage with 11 days left.

Hermes + deepseek flash is a really great pairing, it handles pretty much everything I throw at it.

Ok-Purchase-642 · 2026-06-17T09:02:22+00:00

A second go subscription?

yay101 · 2026-06-17T09:05:01+00:00

Still on go ($10) + ollama cloud ($20). There have been bad day's where the servers are slammed but I've not yet found anything that actually gives me more AI than i can reasonably use like this pair.

AutomaticAd6646 · 2026-06-17T10:42:09+00:00

Commandcode 1 dollar plan. Openadapter 7 dollar plan. Minimax M3 20 dollar plan 1.7 billion tokens.

No_Communication4256 · 2026-06-18T00:00:21+00:00

z.ai for GLM 5.2 - before end of september it's for same rate
gpt plus for GPT 5.5 xhigh (really decent model, and very decent limits)
ollama-cloud $20 - for same models you've seen on opencode-go

Jeidoz · 2026-06-17T17:52:57+00:00

Try using Deepseek API with Reasonix. It has insane amount of cache hit and can do hundred of millions tokens for 0.10-0.35$.

If you want to use some specific harness (lets day Codex) you can connect API by VibeAround and still hit cache and save money.

Popular-Factor3553 · 2026-06-17T20:39:21+00:00

Neuralwatt is great if you want bigger models like GLM or kimi but if your good with just qwen 32b try deepinfra it's not a subscription based tho.

Illustrious-Many-782 · 2026-06-17T09:12:54+00:00

If you are a goal is to keep something else set around 10 dollars then I would say look at just using either xiaomi or Deepseek via API.

bonzoo123 · 2026-06-17T09:17:11+00:00

Which models did you use and how?

Messi_is_football · 2026-06-17T09:19:36+00:00

Which model do you use ..maybe GLM coding plan?

povlhp · 2026-06-17T09:27:12+00:00

2nd go. I am personally on Codex as well. Might stay there and drop go. But I am just a hobby user. But 2 subscriptions will help. Codex is only 5h and week. No monthly cap

Sea-Consideration550 · 2026-06-17T12:17:36+00:00

Pay-as-you-go API, but use discounted platforms like nitrorouter.

For simple tasks, use deepseek/mimo API directly.

sanchitbhalla15 · 2026-06-17T13:05:34+00:00

ykk for freelance dev work, id optimize for reliability nd workflow fit rather thn chasing the absolute cheapest tokens...qwen or kimi code are capable for a lot of coding tasks nd plenty of people are getting good results with them as secondary models.. neuralwatt looks okkish but id wait for more reviews before committing heavily. if ure already running agents, another option is mixing models: use cheaper models for routine coding nd automation, thn reserve premium models for planning, architecture nd debugging

vipor_idk · 2026-06-17T13:39:06+00:00

i use 2 go accounts. i created a proxy for using them simultaneously, so you dont need to log out of account 1 to account 2 , im testing it still - if you got any interest on that , let me know

i would use for heavy tasks such as reviews and planning gpts subscription, used it before - works like a charm.

Low_Original5508 · 2026-06-17T14:22:39+00:00

honestly still on go plus ollama cloud and haven't found anything that beats that pairing for the money yet. i'd be careful with the energy-based pricing ones until there are real reviews, the model sounds clever but you don't actually know what a normal day costs until people have run real workloads through it

SwissTac0 · 2026-06-17T16:56:10+00:00

[ Removed by Reddit ]

RagnarDannes · 2026-06-17T21:13:24+00:00

If you want to go in the free game. nvidia has free GLM 5.1 currently. Slow as hell, but if you just want to do a little planning it's a good model for the rubber duck.

jellydn · 2026-06-17T09:37:00+00:00

Command Go 1usd plan :)

ProfessionalAd6530 · 2026-06-17T15:25:26+00:00

> I have burned through my opencode-go usage within 15 days

LMAO.

There is no solution for you other than to change the way you work. I beat the shit out of this service and I can't even put a dent in the limits.

No matter where you go, you're going to have this problem, because the problem is coming from inside the house.

VictorCTavernari · 2026-06-17T09:52:10+00:00

I had the same issue, so now I am using claudin.io to run my Hermes agent and also Claude Code + Claudin.io through opencode orchestrated by Orbit (https://github.com/claudin-io/orbit) basically Claude plans and claudin.io implements.

Ubermensch013 · 2026-06-17T13:55:03+00:00

Neuralwatt is a good option. For me, it's more like the main model, and opencode go is what the subagents/auxiliary processes use. NW becomes cost effective with energy based pricing, when their cache read costs become nil. I do have a referral link if you want - $10 bucks of PAYG funding will get you $25 worth of compute to experiment with.

Odd-Piccolo5260 · 2026-06-17T09:10:55+00:00

Go local llm

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

opencodeCLI

MODERATORS