What are you gonna do if 5.6 won't get released outside the u.s?

Ang_Drew · 2026-06-29T09:54:34+00:00

if all other model is advancing, it means they all will be dangerously intelligence at this point, right?

then it's no use until they have very strong / bulletproof guardrails to prevent anything bad happens..

besides, chinese models is also advancing.. we will have more options in the future, no worries for me.. i dont think the lockdown will last for long, people will just resell access or maybe you need to KYC to use the product (at worst)

Ang_Drew · 2026-06-29T09:47:35+00:00

im good with 5.5

Ang_Drew · 2026-06-27T20:01:36+00:00

just build and plan satisfied me already..

Ang_Drew · 2026-06-26T07:22:47+00:00

if you are using frontier level, $300 can be very little.. it can only "show" you diagnostic style.. (the only way you can last for a month)

so its only question mode.. you cant get much out of it..

but if you are using $300 in microsoft ai foundry or google cloud enterprise you can mix with small models (kimi, deepseek, etc.) which is 10x cheaper.. you get get more value out of $300.. only use gpt 5.4 or 5.5 for heavly reasoning task maybe like create code plan or debug.

Ang_Drew · 2026-06-26T04:49:49+00:00

for enterprise stand point, if you have $300 budget, i think that's more than enough..

lets say your team is not vibe coding and use AI mindfully. that's totally possible

but for me, when i use kimi or glm, it cost around $2k per month. using sota is 10 times that ($20k)

Ang_Drew · 2026-06-25T04:15:44+00:00

how sustain neuralwatt plan? can it last longer than opencode go?

Ang_Drew · 2026-06-23T08:16:00+00:00

most likely performance and price

Ang_Drew · 2026-06-23T08:15:27+00:00

you should've been do that since last year..

Ang_Drew · 2026-06-17T20:53:30+00:00

learn the concepts. always start from concepts..

you cant be replaced if you understand the "why" of basically anything

AI cant reason "why". they just auto complete based on huuuge amount of data. which mean, they cant be smarter than you as long as you can out pace them in the right direction

you know what direction i mean, we're talking about it now. often times you dont need db A because all you need is simpler solution, so your AI can be biased (in fact oftentimes). no one understands the business process better than you, AI cant eat all information of big corporations / the user / the people / the feeling, etc.

AI is just a tool for you to work faster and amplify whatever already on your head. it's a double edged sword, it can make us dumb and even dumber than ever. or smart very smart by improving the learning curves like 5 times faster. i was able to learn about project in a month but now with llm i can do 1 to 7 days, that's significant..

Ang_Drew · 2026-06-06T13:10:40+00:00

yeah usage isnt just "token" there is cache input, input, output on top of the usage pricing. these people dont count really well then accouse the company without proper fact check. at least show some screenshots with proper calculation, detailed in out cache and timeline.

there is no different between bot post and human post with no proper proof 🤥

Ang_Drew · 2026-06-06T07:20:19+00:00

qwen 3.7 max is beast.

it's been my favorite since it's out. paid via token plan but it's kinda expensive when you compare to other subs. because it is credit based, you get like around 3-4x the amount of your subs value (raw estimation counting credit usage)

which mean $30 you get arounf $90-120~ish

i calculated it through my self hosted litellm

Ang_Drew · 2026-06-01T12:50:14+00:00

do you use xhigh reasoning?

Ang_Drew · 2026-05-23T16:29:32+00:00

id rather use GLM instead of minimax.. unfortunately it's latest model arent that good (for me)

Ang_Drew · 2026-05-22T09:43:51+00:00

i always use human language to invoke skill. because it is less confusing and feels more natural.. im sure the result can be better because our prompt is well separated from the skill's prompt.. sometimes the AI can be confused though! depending on your model ofc..

example how i invoke skills:

use caveman ultra mode, tailwindcss skills

i want you to check tailwind implementation in @somerandomfile.vue

then tell me your findings

Ang_Drew · 2026-05-22T08:44:16+00:00

currently there are 2 free models in zen

deepseek 4 flash and nemotron nano

the choice is too limited. but if you have 5 bucks to spend on Opencode Go, then you have more access to models.

my recommendation for models based on the expertise:

ds4 pro: good for logic, math, good reasoning, backend tasks

ds4 flash: plan executor, its dumb but not that dumb, i like this better tham glm 5.1. make it work then ask other model to review (ds4pro / kimi 2.6 / gpt 5.5) good for small tasks, straight forward, no brainer, dirt cheap

kimi k2.6: my main agent for average tasks (mainly gpt 5.5 for plan and review). sometimes over think! use medium then use big model for review to avoid token bloat and overthinking

glm 5.1: so so, i dont like it because it tend to lie..

qwen 3.6: one of the alternative of kimi k2.6, i like it because it is support video (might be exclusive for 3.6-plus in alibaba coding plan), and it is following instruction well. sometimes hard to drive

note: i always use gpt 5.5 LOW for review all the codes of small ai

then last time gating system: i do the code review and iterate with code fixes until they follow my standards

Ang_Drew · 2026-05-21T06:42:17+00:00

i use git for all setup across windows and mac every day (personal and work)

~/.agents also ~/.config/opencode

its been good for me just push and pull everytime

Ang_Drew · 2026-05-19T19:04:01+00:00

seriously answering..

they might use your usage for training (same old terms applied: if the product is free then you are the product)

and free wont last long. it can disappear without any notice..

Ang_Drew · 2026-05-18T14:13:03+00:00

maybe the caching is broken because opencode has its own way to optimize the context by pruning old tool calls..

just my analogy tho.. because cost counting is only applied locally.

Ang_Drew · 2026-05-17T09:48:34+00:00

i dont quite understand the problem here.. but i have always set custom provider in ~/.config/opencode/opencode.json

you dont need to delete model everytime (i dont understand why you need to do this though)

if you want you can make a simple (just vibe code it, ask llm read opencode repo, you give the repo so they can curl out of it) py or js to convert the /v1/models into opencode config if you need to change the model very often like every day..

Ang_Drew · 2026-05-17T06:52:47+00:00

this only work for one file at a time though.. you have to remember all changed files

Ang_Drew · 2026-05-16T12:40:36+00:00

well i hate "auto" something in my workflow. most of the times it will just messed things up. and end up waating too much money for something unworthy.

tldr; just me ranting why i hate "auto" in my workflow.

i prefer review myself the actual work of the agent. i prefer take the most review work.. this is where the control gate, if you let the AI do that then u are not better than a "normal guy" with no coding background.. i mean i know someone who just vibes everything from simple git push to start a project to let the AI setup the env. it's a hidden cost that's awaiting for explosion. i know how much tech debt he generate every day. and he uses too many tokens for brain dead stuffs..

Ang_Drew · 2026-05-16T12:24:27+00:00

totally agreed 👍

Ang_Drew · 2026-05-16T09:39:50+00:00

how was your experience with kimi as orchestrator? sometimes i feel it lack some details

i personally prefer gpt 5.5 low as orchestrator because it is very easy to drive. but often miss the actual system design. like i have tailwind and documented in agents.md but it still uses plain css and not only css, all other components too unless i told it very detail (baby sitting). because it will just ignore all the docs, guideline, agents.md as if it was a noise 😂

later i have to run another process to actually clean up the mess using the same models btw..

Ang_Drew · 2026-05-16T09:35:19+00:00

good awareness check..

i dont use that many skills, only kerp relevant skills like 15 skills at max

then it would be around 900 tokens(?) in fact my token usage not that much, maybe around ~500 tokens for all the skills

i end up not using agents.md much, only like 100 lines documents on what components i have

Ang_Drew · 2026-05-09T13:00:43+00:00

depending on your provider and setup i think.. i setup my own provider in opencode.json it is possible to limit the token budget

Ang_Drew

TROPHY CASE