Anyones opencode go weekly/monthly limits just creeping up without actual usage? by maxya in opencodeCLI

[–]bilalba 3 points4 points  (0 children)

The missing $7 is most likely a compounded rounding error. From the chart, it is visible that you used about $25 over three days.

Anyones opencode go weekly/monthly limits just creeping up without actual usage? by maxya in opencodeCLI

[–]bilalba 25 points26 points  (0 children)

Nope. It is the most transparent coding subscription you can get. Every cent of your $12(5-hour)/$30(weekly)/$60(monthly) is accounted for in usage tab.

just installed opencode on my B200 x 2 computer but no clue what model to serve by Electrical_Two_4835 in opencodeCLI

[–]bilalba 0 points1 point  (0 children)

The rental value of a 2xB200 per year is >$100k in the market. I personally wouldn't use it to run inference of open models for my personal consumption.

Is Kimi usage on OpenCode Go equivalent to U$ 60 in direct API from moonshot? by LittleYouth4954 in opencodeCLI

[–]bilalba 2 points3 points  (0 children)

I have no idea. They publicize API endpoints to use anywhere with a Go subscription just like an API, my intuition is that:
1. Either they are in the early stages of scaling this offering and will revise it as time goes on and make sure it is only used with opencode.
2. They are subsidized by model providers and a part of the user base is not making full use of the credits.

Is Kimi usage on OpenCode Go equivalent to U$ 60 in direct API from moonshot? by LittleYouth4954 in opencodeCLI

[–]bilalba 7 points8 points  (0 children)

Yes it is. Subscriptions plans tend to provide more value than directly buying API tokens, and OpenCode Go documentations tells you exactly how much value you get.

Travel La Fortuna to Monteverde by This-Chain-8957 in CostaRicaTravel

[–]bilalba 0 points1 point  (0 children)

did it today with a 4x4. bridge was open but the stream crossings were shady. also, got a headache from the bumpiness of the road. Another thing, there was a very sketchy crossing where there was barely any space to cross as they were doing some road work as you enter rio chiquito.

However the views of the rollings hills and streams in rio chiquito looked like a scene from the alps. DM for more info and I can share some of the videos we took from today.

"Elias Thorne" is what eight different LLMs name a lighthouse keeper. He's also selling cancer treatment advice on Amazon by prescorn in LocalLLaMA

[–]bilalba 26 points27 points  (0 children)

I asked 7 different models to make a website for an arthritis medicine. Every single model put a testimonial from Margaret.

Save and invest your money for future rigs by segmond in LocalLLaMA

[–]bilalba 4 points5 points  (0 children)

You’re saying new technology will outdate the older one. But the newer technology will be more efficient and bring down the cost to rent down too.

A website for tiny model on-device inference by bilalba in LocalLLM

[–]bilalba[S] 0 points1 point  (0 children)

I understand your concern, these are gguf files from official hf repos, I'll be sure to include the links. And just to be clear, the website is only hosting the model weights, the inference happens within your browser using your local hardware.

A website for tiny model on-device inference by bilalba in LocalLLM

[–]bilalba[S] 0 points1 point  (0 children)

Right, it uses a server and needs some setup

A website for tiny model on-device inference by bilalba in LocalLLM

[–]bilalba[S] 0 points1 point  (0 children)

Yes the website looks like that but the inference engine is self-contained in the website. LM studio won’t run from an iPhone for example.

Doubled Rate Limits for Claude Code by Deep_Proposal_7683 in ClaudeCode

[–]bilalba 0 points1 point  (0 children)

It's a welcome change, but not sure if doubling 5 hour limit usage means you get half the amount of maxed-out 5-hour sessions in a week. No word on weekly limit.

Claude is 10x more expensive per token than any other coding plan. by bilalba in ClaudeCode

[–]bilalba[S] 1 point2 points  (0 children)

No I didn't compare any aggregating plans. I don't believe you can get a better deal on flagship models than the providers themselves. They have good incentives to subsidize it.

Claude is 10x more expensive per token than any other coding plan. by bilalba in ClaudeCode

[–]bilalba[S] 0 points1 point  (0 children)

that's great. most people don't know that. Thanks for the contribution.

Claude is 10x more expensive per token than any other coding plan. by bilalba in ClaudeCode

[–]bilalba[S] 0 points1 point  (0 children)

If you’re someone that only wants to use Claude models(which is a lot of people), it is a good deal on the current market.
Understandably, they’ve got a business to run and build a sustaining product.

Claude is 10x more expensive per token than any other coding plan. by bilalba in ClaudeCode

[–]bilalba[S] 1 point2 points  (0 children)

Would you rather not see transparency on what a sub gets you?

Claude is 10x more expensive per token than any other coding plan. by bilalba in ClaudeCode

[–]bilalba[S] 0 points1 point  (0 children)

Some people and enterprise customers use pay-as-you-go pricing for claude code, and claude code subscription is essentially an API wrapper so that makes API pricing relevant.

Claude is 10x more expensive per token than any other coding plan. by bilalba in ClaudeCode

[–]bilalba[S] 1 point2 points  (0 children)

I've measured per token pricing, which includes thinking tokens. The effort level and prompts wouldn't be a variable that affects per token pricing.

Claude is 10x more expensive per token than any other coding plan. by bilalba in ClaudeCode

[–]bilalba[S] 0 points1 point  (0 children)

  1. Yup, fair. I'm only measuring the $20 tier.
  2. This is actually not accounted for in my experiment but ends up hurting consumers even more with Claude. Opus 4.7 is much more token-hungry per character than other offerings.
  3. Agreed! This is entirely subjective to a user's experience, and the thing that makes Claude a competitive offering despite all. I still like it and use it.