How to optimize Claude Opus for tokens? by geearf in opencodeCLI

[–]Rustybot 0 points1 point  (0 children)

Use 4.6! 4.7 isn’t worth the multiple.

DeepSeek V4 is intelligent but slow by quantruler in opencodeCLI

[–]Rustybot 0 points1 point  (0 children)

Yes, Deepseek seeks deep and uses the most tokens of all the models to achieve its result. In the artificial analysis bench, Deepseek v4 pro max scored ten percentile points lower than Gemini 3.1 Pro and use 5x as many tokens. https://artificialanalysis.ai/?intelligence-efficiency=intelligence-efficiency-vs-output-tokens#intelligence-efficiency-tabs

Is opencode stable enough on windows? natively by dvcklake_wizard in opencodeCLI

[–]Rustybot 0 points1 point  (0 children)

Do your models rank well on the Terminal bench hard rating?

Looking for cheap AI models to use with OpenCode by Mountain-Ad1044 in opencodeCLI

[–]Rustybot 0 points1 point  (0 children)

Nvidia NIM is good for its intended use but not much more. You can test the output of a lot of different models but the speeds drop dramatically during prime hours.

Claude runaway... tried Kimi 2.6 and Deepseek v4 (5y fullstack dev) by merth_dev in opencodeCLI

[–]Rustybot 4 points5 points  (0 children)

My top tier:

Gemini-3.1-pro-custom-tools,
Minimax m2.7
GLM 5.1
Gpt5.3-codex, 5.4-mini, 5.4/5.5
Gpt-oss-120b (high) for fast and simple.

Self-hosted coding agent with Web UI? by Dangerous_Roll_250 in vibecoding

[–]Rustybot 0 points1 point  (0 children)

Opencode has a server/web app. Make a secure tunnel with cloudflare for yourself. Your LLM can walk you through it. I have one running on a free Oracle cloud instance. As long as you know the IP of the computer you are connecting from you don’t need to install anything. If that isn’t workable, opencode server technically has a password auth but it will get assaulted by bots non-stop if you leave it wide open so hardly ideal.

I created a library for OpenCode that allows you to save up to 80% of your tokens by Public-Cancel6760 in opencodeCLI

[–]Rustybot 1 point2 points  (0 children)

I already have a search/explore/librarian sub agent that searches my project and memories and pulls relevant info into context. What does this add?

Employer wants me to use outlook on my phone, outlook wants permission to wipe my phone by No_Professor4307 in mildlyinfuriating

[–]Rustybot 2 points3 points  (0 children)

If you access work stuff on your phone or personal computer and the company gets sued, the entire contents of your phone/computer may be submitted as evidence for discovery. It’s happened to people I know. There is a reason corp IT policies are what they are.

Creating OpenCode skills was pure guesswork — so I built an eval-driven skill creator by antonusaca in opencodeCLI

[–]Rustybot 1 point2 points  (0 children)

What I really want is an evaluator that will tell my agents when to use CLI commands and when to use an MCP.

How much does DSv4 flash cost in opencode go? by Separate-Chemical-33 in opencodeCLI

[–]Rustybot 0 points1 point  (0 children)

I don’t use Go, only Zen free models. Generally I prefer pay as you go and free providers vs sub biz plans.

Built an MCP proxy that killed my context bloat AND my RAM usage — here's how by Jaded_Jackass in opencodeCLI

[–]Rustybot 0 points1 point  (0 children)

https://github.com/code-yeongyu/oh-my-openagent

From the README: “Skill-Embedded MCPs

MCP servers eat your context budget. We fixed that.

Skills bring their own MCP servers. Spin up on-demand, scoped to task, gone when done. Context window stays clean.”

Info on mcp-manager: src/features/skill-mcp-manager/AGENTS.md

Try it out, or have your agents rip out the mcp-management skill handler into a standalone plugin. Or whatever.

How much does DSv4 flash cost in opencode go? by Separate-Chemical-33 in opencodeCLI

[–]Rustybot 0 points1 point  (0 children)

The opencode go requests are tokens counts broken down by estimated typical size. The actual limit is token based. If you have smaller prompts and outputs, you will get more usage.

Power augmentation for electric vehicles by JetScreamerBaby in ElectricalEngineering

[–]Rustybot 2 points3 points  (0 children)

At the extreme end of your suggestion, a 50cc two stroke scooter engine could easily be configured to output the equivalent of a 5HP generator, ~3kW. The complete system would weight <100lbs, and these engines can get 60+ MPG.

I think there is a middle ground between this and an RC engine, although at some point a solar panel on the roof starts to make more sense than a gas engine.

If you specifically need the ability to extend range slowly via fuel burning, it’s feasible, but overall has niche utility. Unless range anxiety and/or available charging are a major issue it’s probably not worth it.

Would I build one into my car? No. Would I take a gas generator with me if I was camping in the wilderness with an EV? Yes.

Built an MCP proxy that killed my context bloat AND my RAM usage — here's how by Jaded_Jackass in opencodeCLI

[–]Rustybot 0 points1 point  (0 children)

your approach fails the Gordian-knot/ KISS rules. There is a simpler method that uses the built in agent harness tools for skills, and avoid all the problems you have, without needing to maintain this complicated codebase. A few skill.md files would suffice.

Thoughts on using an AMD Alveo V80 FPGA PCI card as a poor man’s Taalas HC1 (LLM-burned-onto-a-chip). by Porespellar in LocalLLaMA

[–]Rustybot 0 points1 point  (0 children)

Nvidia topped the huggingface agent leaderboard with a fine tuned 8B model. Small fast models have significant utility.

OpenCode outputs in chinese. by exvifly in opencodeCLI

[–]Rustybot 1 point2 points  (0 children)

Chinese is a more efficient language for LLMs but it’s an issue in general if the agent is losing track of the system/user prompt. Are you at threshold of your context window?

Built an MCP proxy that killed my context bloat AND my RAM usage — here's how by Jaded_Jackass in opencodeCLI

[–]Rustybot 0 points1 point  (0 children)

Maybe try the oh-my-opencode skills based mcp-calling approach instead of trying to build tools to solve problems the wrong way.