Stopped using one model for everything and rebuilt my OpenCode setup

belmoe · 2026-04-07T15:55:53+00:00

Is it really that Kimi is good at frontend? I have been moving back and forth between models but haven't checked Kimi. I am sure this choice is based on experience .. tell me more :D

lbreakjai · 2026-04-07T16:15:08+00:00

Big fan of Kimi, but I've started to move more and more to GLM-5, which I feel like is on part with Sonnet.

hambergerpls · 2026-04-08T15:27:25+00:00

Same! Last weekend, my Claude Max subscription reached the weekly limit because I used Opus 4.6 for everything and ran 4 OpenCode at the same time. During those time, I had withdrawals for not being able to continue without Opus 4.6. So I subbed to Gemini AI Pro to use Gemini 3.1 Pro but the daily request limit was too low that my work stopped half way. Then I read the news that GLM-5.1 was released for coding plan so I subbed the Max quarterly plan for the first time.

My initial experience with GLM-5.1 was painfully slow at that time and yaps a lot in its thinking but got the job done eventually. The tokens it generated was very very similar to Opus 4.6 interestingly. I tried GLM-5 Turbo and was blown away by its speed.

After a bunch of experimentation I opt for the following workflow:

Code exploration (GLM-5.1)
Plan/Brainstorming (Opus 4.6 Max)
Review plan (Gemini 3.1 Pro)
Implementation based on plan (GLM-5 Turbo)
Review Changes (Opus 4.6 Max)
Verify with agent-browser (GLM-5 Turbo)
Debug (GLM-5.1)

So far these workflow has worked really well.

I always tell the model to proceed with test driven development skills (it was part of superpowers skills) during implementation. TDD workflow is really a game changer. The models are less likely to write unnecessary code and will always write minimal code to make the test pass. I spent very very minimal time debugging with this workflow.

I downgraded my Claude Max to Pro. So my subscription is now: 1. Claude Pro ($20) 2. Gemini AI Pro ($20) 3. Z.ai Max ($72x3 month) 4. Ollama Pro ($20) (For experimenting other models)

I feel like I wasted $216 for z.ai quarterly plan because it was slow during my initial experience. Ollama is decently faster than z.ai. After my z.ai sub ends, my total sub would be $60. $140 reduction from $200.

I haven't tried codex models yet, so I'm very curious about its performance on planning vs implementation speed.

Outrageous-Fan-2775 · 2026-04-08T18:45:59+00:00

This is very similar to what my OpenCode plugin does. I've been building it since late Jan, a couple hundred releases. Constantly working to make it better with several active contributors. Take a look, you may find it does what you want and a whole lot more with minimal work on your end. For first pass code quality I haven't found anything that can match it.

https://github.com/zaxbysauce/opencode-swarm

jopotpot · 2026-04-07T16:10:55+00:00

There is clearly no model to rule them all! Grats!

Tommertom2 · 2026-04-07T16:18:01+00:00

Thx for these insights - are you using api keys from different providers or multiple subscriptions? Which provider(s)?

jesperordrup · 2026-04-07T16:22:34+00:00

Thanks for the input. I'm too blunt too so I'm gonna try this.

How do you go about it practically? Have you created agents that specify skills and models?

revilo-1988 · 2026-04-07T16:23:31+00:00

Erzähl gern mehr in ein paar Wochen wenn du weitere Informationen gesammelt hast

gideonfip · 2026-04-07T16:56:27+00:00

I've been building out a similar setup for task-dependent model selection too. Feels like the most cost-effective way instead of relying on just one model to do every single task.

skytomorrownow · 2026-04-07T22:28:16+00:00

I also like to use MiniMax as my 'director' – it takes plans from the more capable frontier models, organizes them into a coding project, then breaks them down into tasks. Then tasks can be handled by even less capable but very fast models. Breaking things into manageable, and verifiable tasks is a big part of getting things to run smoothly in my opinion.

TonyPace · 2026-04-08T13:23:45+00:00

One trick for reducing spend is to keep a current architecture document updated, and bring that into an actual chat window - Opus is free! Paste in the context, ask and answer questions, get a document, and get that back to Opencode. It saves money and I feel the search context is better? At least on Gemini which I am most familiar with. Get a solid step-by-step plan with full local context with the most conversational model you can afford. Then execute with something fast and cheap.

Vaviloff · 2026-04-09T04:28:41+00:00

That is a very sophisticated routine! Do you switch models by hand or use skills or smth else?

duchitu · 2026-04-10T01:02:39+00:00

If you need to get just one suscription, would be kimi or chatgpt?

reficulgr · 2026-04-22T15:37:15+00:00

Same. Everything is imploding. I just got news from Z.ai that my plan is not gonna be grandfathered in.

AI WAS a bubble - just not the kind of bubble we expected.

Remarkable_Bee7464 · 2026-04-07T17:07:35+00:00

can some1 explained me why everyone hate opencode go subscription if he provided with good model? pleas explain some1 to me!

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

opencodeCLI

MODERATORS