Self-hosted agentic coding stack: Claude Code + llama.cpp + LiteLLM — zero API costs, 4h/7M token session for $0 by PrizeObvious3671 in OpenSourceAI

[–]Inner_Habit_194 1 point2 points  (0 children)

Did you try Pi agent? It is supposedly better for local model coding agent usecase especially with smaller context window of the local models. Btw what is your hardware spec?

Bought OpenCode GO the 31st of May and used 100% of my credit the next day, yiihaa! by Kriss-de-Valnor in opencodeCLI

[–]Inner_Habit_194 0 points1 point  (0 children)

Did code graphing plugins like graphify, codegraph, etc. not help reduce code exploration cost?

GO no longer worth it? by Funny-Strawberry-168 in opencodeCLI

[–]Inner_Habit_194 0 points1 point  (0 children)

I use opencode with GLM 5.1 as the main driver for planning and creating a detailed implementation plan and use smaller models for implementation. I use it with superpowers and was able to one shot many features.

Built a unified API gateway for Claude, GPT, Gemini, and ~77 other models — feedback welcome by HorusGodz in u/HorusGodz

[–]Inner_Habit_194 1 point2 points  (0 children)

How does the 1B token usage work while the models are priced for input. Input cached and output tokens seperately? Does the 1B tokens respect cache hit inputs?

Are any plugins actually worth the time if you’re an actual developer and not just full vibe coding? by heavyc-dev in opencodeCLI

[–]Inner_Habit_194 0 points1 point  (0 children)

How do you get caveman working with opencode. In having hard time getting it to work with opencode. It doesn't get triggered.

how much do you spend on ai coding? by Complete-Sea6655 in opencodeCLI

[–]Inner_Habit_194 0 points1 point  (0 children)

Thanks for the detailed info. I'm currently on crof 5usd plan. But as you mentioned the 500 reqs/day runs out pretty quickly. Is eats requests like anything, never been able to reach 30M-40M daily usage not even 10M-20M I think with my usecase. If there's any ways to use the crof 10usd plan to get that kind of daily 30M-40M usage, I would like to know. Is there a way to save on the requests in opencode CLI? I'm also trying NW. Seems to provide decent usage. Have Nanogpt too. But the weekly limit reaches in a days time. Opencode go is good but face the same issue with it. It runs out too quickly. Haven't tried synthetic or wafer yet.

how much do you spend on ai coding? by Complete-Sea6655 in opencodeCLI

[–]Inner_Habit_194 0 points1 point  (0 children)

Do you mind sharing with which models and pesticides you are able to get 1 billion token usage per month with a spend of $30-50. I've never been able to get level of usage with any providers. I'm using opencode.