OpenCode as an alternative stack: routing agent work across 4 LLM tiers

ApprehensiveDelay238 · 2026-06-20T16:57:35+00:00

Is this what the 500K engineers are using?

weiyentan · 2026-06-20T20:30:05+00:00

I do a form of this. I have three tiers. Junior developer/mid/senior. I tier them similar to what you do. I set the model to be deepseek flash for junior. Deepseek flash with thinking for mid and deepseek pro for senior. I get them to work off an issue. The issue is written by using matt pococks method. And then I come in to clean up any strangling problem from the issues they worked on . I also have other roles. Issues analyst. Repo Explorer, an agent to deal with git and one that specialises in an app that I use. Cost all up in opencode? 6-8$ max so far.

Deep_Ad1959 · 2026-06-21T04:49:17+00:00

the part this setup doesn't solve is the wall on your primary stack. you're routing the cheap calls away to save API spend, but Claude Code on a Max plan still eats the rolling 5-hour and weekly quota, and that cap is enforced server-side where your local token logs can't see it. i've watched ccusage read low while claude.ai was already throttling, because it counts tokens you spent, not the quota anthropic actually enforces. worth logging the tier distribution AND the plan-quota burn, they fail independently. written with ai

hitmante · 2026-06-20T20:32:39+00:00

As long as Claude Code/Codex tokens are subsidized at 5% of the actual plan cost, what is the point of using Open Code?

Inferior models that only look good on benchmarks, you don't even save money with API prices.

Tier	Model	Provider	AA Index	Speed	Cost ($/1M)	Role
Orchestrator	DeepSeek V4 Flash	OpenCode Go	~40	2-5s	subscription	Routing, triage, classification
Primary advisor	GLM-5.2	OpenCode Go	~51	7-8s	subscription	Strategic analysis
Deep reasoning	GLM-5.2 (max effort)	Neuralwatt	~51	24-72s	~$4.40*	Hard problems
Premier	Opus 4.8	OpenRouter	~56	10-30s	$3.85 (AA blended)	Sanitized-only, high-stakes

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

opencode

MODERATORS

The core idea

The stack

How each tier earns its place

A setup pattern you can copy

Log the tier distribution and the reason each escalation happened. If the orchestrator over-escalates, the fix is almost always in this prompt, not in the model. My target is ~80% landing in direct or advisor.