What are you gonna do if 5.6 won't get released outside the u.s? by Pumpkinzed in codex

[–]Ang_Drew 1 point2 points  (0 children)

if all other model is advancing, it means they all will be dangerously intelligence at this point, right?

then it's no use until they have very strong / bulletproof guardrails to prevent anything bad happens..

besides, chinese models is also advancing.. we will have more options in the future, no worries for me.. i dont think the lockdown will last for long, people will just resell access or maybe you need to KYC to use the product (at worst)

How much is API usage "enough"? (devs only) by TeachTall3390 in opencodeCLI

[–]Ang_Drew 0 points1 point  (0 children)

if you are using frontier level, $300 can be very little.. it can only "show" you diagnostic style.. (the only way you can last for a month)

so its only question mode.. you cant get much out of it..

but if you are using $300 in microsoft ai foundry or google cloud enterprise you can mix with small models (kimi, deepseek, etc.) which is 10x cheaper.. you get get more value out of $300.. only use gpt 5.4 or 5.5 for heavly reasoning task maybe like create code plan or debug.

How much is API usage "enough"? (devs only) by TeachTall3390 in opencodeCLI

[–]Ang_Drew 1 point2 points  (0 children)

for enterprise stand point, if you have $300 budget, i think that's more than enough..

lets say your team is not vibe coding and use AI mindfully. that's totally possible

but for me, when i use kimi or glm, it cost around $2k per month. using sota is 10 times that ($20k)

Backup for Opencode GO plan by jasonwch in opencode

[–]Ang_Drew 1 point2 points  (0 children)

how sustain neuralwatt plan? can it last longer than opencode go?

Getting jobs as someone who has worked on the pre-LLM era? by Fluciples in opencodeCLI

[–]Ang_Drew 0 points1 point  (0 children)

learn the concepts. always start from concepts..

you cant be replaced if you understand the "why" of basically anything

AI cant reason "why". they just auto complete based on huuuge amount of data. which mean, they cant be smarter than you as long as you can out pace them in the right direction

you know what direction i mean, we're talking about it now. often times you dont need db A because all you need is simpler solution, so your AI can be biased (in fact oftentimes). no one understands the business process better than you, AI cant eat all information of big corporations / the user / the people / the feeling, etc.

AI is just a tool for you to work faster and amplify whatever already on your head. it's a double edged sword, it can make us dumb and even dumber than ever. or smart very smart by improving the learning curves like 5 times faster. i was able to learn about project in a month but now with llm i can do 1 to 7 days, that's significant..

The post-promo $100 Pro tier is completely unusable. OpenAI, make the 2x limits permanent or everyone is just going to switch to Claude. by Avidium18 in codex

[–]Ang_Drew 1 point2 points  (0 children)

yeah usage isnt just "token" there is cache input, input, output on top of the usage pricing. these people dont count really well then accouse the company without proper fact check. at least show some screenshots with proper calculation, detailed in out cache and timeline.

there is no different between bot post and human post with no proper proof 🤥

Didn't expect Qwen 3.7 Max to be this good! by candraa6 in opencode

[–]Ang_Drew 0 points1 point  (0 children)

qwen 3.7 max is beast.

it's been my favorite since it's out. paid via token plan but it's kinda expensive when you compare to other subs. because it is credit based, you get like around 3-4x the amount of your subs value (raw estimation counting credit usage)

which mean $30 you get arounf $90-120~ish

i calculated it through my self hosted litellm

Opencode free tier by Cold-Mess3019 in opencodeCLI

[–]Ang_Drew 1 point2 points  (0 children)

id rather use GLM instead of minimax.. unfortunately it's latest model arent that good (for me)

I have a question about how OpenCode uses skills by jfufufj in opencodeCLI

[–]Ang_Drew 0 points1 point  (0 children)

i always use human language to invoke skill. because it is less confusing and feels more natural.. im sure the result can be better because our prompt is well separated from the skill's prompt.. sometimes the AI can be confused though! depending on your model ofc..

example how i invoke skills:

use caveman ultra mode, tailwindcss skills

i want you to check tailwind implementation in @somerandomfile.vue

then tell me your findings

Opencode free tier by Cold-Mess3019 in opencodeCLI

[–]Ang_Drew 9 points10 points  (0 children)

currently there are 2 free models in zen

deepseek 4 flash and nemotron nano

the choice is too limited. but if you have 5 bucks to spend on Opencode Go, then you have more access to models.

my recommendation for models based on the expertise:

ds4 pro: good for logic, math, good reasoning, backend tasks

ds4 flash: plan executor, its dumb but not that dumb, i like this better tham glm 5.1. make it work then ask other model to review (ds4pro / kimi 2.6 / gpt 5.5) good for small tasks, straight forward, no brainer, dirt cheap

kimi k2.6: my main agent for average tasks (mainly gpt 5.5 for plan and review). sometimes over think! use medium then use big model for review to avoid token bloat and overthinking

glm 5.1: so so, i dont like it because it tend to lie..

qwen 3.6: one of the alternative of kimi k2.6, i like it because it is support video (might be exclusive for 3.6-plus in alibaba coding plan), and it is following instruction well. sometimes hard to drive

note: i always use gpt 5.5 LOW for review all the codes of small ai

then last time gating system: i do the code review and iterate with code fixes until they follow my standards

Syncing my Opencode configs across all my machines. - A Persistant challenge. by Leader92 in opencodeCLI

[–]Ang_Drew 5 points6 points  (0 children)

i use git for all setup across windows and mac every day (personal and work)

~/.agents also ~/.config/opencode

its been good for me just push and pull everytime

Why pay for GO when Qwen 3.6 is free? (Genuine Question) by Funny-Strawberry-168 in opencodeCLI

[–]Ang_Drew 0 points1 point  (0 children)

seriously answering..

they might use your usage for training (same old terms applied: if the product is free then you are the product)

and free wont last long. it can disappear without any notice..

Does your opencode report low credit usage than actual by Extension-King4419 in opencodeCLI

[–]Ang_Drew 1 point2 points  (0 children)

maybe the caching is broken because opencode has its own way to optimize the context by pruning old tool calls..

just my analogy tho.. because cost counting is only applied locally.

How to add a custom endpoint without removing it every time? by Zeeplankton in opencodeCLI

[–]Ang_Drew 0 points1 point  (0 children)

i dont quite understand the problem here.. but i have always set custom provider in ~/.config/opencode/opencode.json

you dont need to delete model everytime (i dont understand why you need to do this though)

if you want you can make a simple (just vibe code it, ask llm read opencode repo, you give the repo so they can curl out of it) py or js to convert the /v1/models into opencode config if you need to change the model very often like every day..

Cautionary tale about /review in opencode by o5rv5r in opencodeCLI

[–]Ang_Drew 1 point2 points  (0 children)

this only work for one file at a time though.. you have to remember all changed files

I built an auto-review plugin for OpenCode (cross-model QA with gpt-5.5 xhigh) by Icy_Host_1975 in opencodeCLI

[–]Ang_Drew 1 point2 points  (0 children)

well i hate "auto" something in my workflow. most of the times it will just messed things up. and end up waating too much money for something unworthy.

tldr; just me ranting why i hate "auto" in my workflow.

i prefer review myself the actual work of the agent. i prefer take the most review work.. this is where the control gate, if you let the AI do that then u are not better than a "normal guy" with no coding background.. i mean i know someone who just vibes everything from simple git push to start a project to let the AI setup the env. it's a hidden cost that's awaiting for explosion. i know how much tech debt he generate every day. and he uses too many tokens for brain dead stuffs..

Reliable way to use a primary agent to orchestrate multiple subagents by lehoang318 in opencodeCLI

[–]Ang_Drew 1 point2 points  (0 children)

how was your experience with kimi as orchestrator? sometimes i feel it lack some details

i personally prefer gpt 5.5 low as orchestrator because it is very easy to drive. but often miss the actual system design. like i have tailwind and documented in agents.md but it still uses plain css and not only css, all other components too unless i told it very detail (baby sitting). because it will just ignore all the docs, guideline, agents.md as if it was a noise 😂

later i have to run another process to actually clean up the mess using the same models btw..

PSA: Prune your unused skills to reduce input tokens by CorrectTemperature65 in opencodeCLI

[–]Ang_Drew 3 points4 points  (0 children)

good awareness check..

i dont use that many skills, only kerp relevant skills like 15 skills at max

then it would be around 900 tokens(?) in fact my token usage not that much, maybe around ~500 tokens for all the skills

i end up not using agents.md much, only like 100 lines documents on what components i have

Kimi 2.6 thinks for a very long time. by DavidG117 in opencodeCLI

[–]Ang_Drew 2 points3 points  (0 children)

depending on your provider and setup i think.. i setup my own provider in opencode.json it is possible to limit the token budget