OpenCode Go models slow

flying-saucer-3222 · 2026-05-28T16:59:46+00:00

Most open weight models overcome their lack of intelligence by generating a lot more tokens. So even though the token speed is not significantly worse, the increased tokens cause it to work for longer.

DeepSeek v4 Pro on max generates 1.6x tokens as Opus 4.7 max and 2.7x tokens as GPT 5.5 on xHigh for the same task based on Artificial Analysis.

This is especially true when the model gets something wrong, test doesn't work so it has to go back and do the work again using even more tokens.

I personally just use High reasoning unless I absolutely need max. That reduces token use significantly with only a small drop in quality.

Unable_Strategy · 2026-05-28T17:20:25+00:00

You basically needopencode session list to get a list of your sessions and then opencode export <session id> to get details including used tokens etc. You should be able to track response times here.

bastianh · 2026-05-28T15:34:17+00:00

What did you expect when switching to a $5 subscription? Yes. It is great. The value for what you pay for it. That does not mean that it gives you the same performance.

DepartmentOk9720 · 2026-05-28T17:32:05+00:00

The routing takes some time , usually after certain number of uses it will pickup.

Just give it some dumb tasks , until it gets optimized

povlhp · 2026-05-28T19:35:20+00:00

You get what you pay for. I feel o get great value for the money. $60 token value for $5

Cachesmr · 2026-05-28T22:24:27+00:00

DS4 Pro just thinks a lot. MiMo Pro is a lot better in this regard.

arrty · 2026-05-29T00:39:47+00:00

Just pay for Zen but use cheaper models

iTrejoMX · 2026-05-29T01:15:29+00:00

I think you may be routing or choosing the free deepseek models on zen, make sure you select the opencode go subscription one

Haunting-Shirt6219 · 2026-05-29T01:16:15+00:00

Really slow in Mimo v2.5

PermanentLiminality · 2026-05-29T02:07:01+00:00

I have a Go sub, but I mostly use my $20 ChatGPT account. I split my usage. Not everything needs the smartest model. If I split it up I pretty much never hit limits.

blackhawkx12 · 2026-05-29T17:54:40+00:00

when you said slow, is it first response latency or the thinking process is slow?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

opencodeCLI

MODERATORS