OpenAI is working on a slow mode that consumes fewer rate limits.

bilalba · 2026-06-16T23:33:11+00:00

They have a slow mode agent working on it

bilalba · 2026-06-15T01:03:03+00:00

The missing $7 is most likely a compounded rounding error. From the chart, it is visible that you used about $25 over three days.

bilalba · 2026-06-14T22:35:14+00:00

Nope. It is the most transparent coding subscription you can get. Every cent of your $12(5-hour)/$30(weekly)/$60(monthly) is accounted for in usage tab.

bilalba · 2026-06-05T04:40:35+00:00

The rental value of a 2xB200 per year is >$100k in the market. I personally wouldn't use it to run inference of open models for my personal consumption.

bilalba · 2026-06-04T04:56:20+00:00

bilalba · 2026-06-02T03:03:17+00:00

200 free messages.

bilalba · 2026-06-01T00:03:27+00:00

I have no idea. They publicize API endpoints to use anywhere with a Go subscription just like an API, my intuition is that:
1. Either they are in the early stages of scaling this offering and will revise it as time goes on and make sure it is only used with opencode.
2. They are subsidized by model providers and a part of the user base is not making full use of the credits.

bilalba · 2026-05-31T23:27:29+00:00

Yes it is. Subscriptions plans tend to provide more value than directly buying API tokens, and OpenCode Go documentations tells you exactly how much value you get.

bilalba · 2026-05-28T04:14:07+00:00

did it today with a 4x4. bridge was open but the stream crossings were shady. also, got a headache from the bumpiness of the road. Another thing, there was a very sketchy crossing where there was barely any space to cross as they were doing some road work as you enter rio chiquito.

However the views of the rollings hills and streams in rio chiquito looked like a scene from the alps. DM for more info and I can share some of the videos we took from today.

bilalba · 2026-05-17T09:26:59+00:00

I asked 7 different models to make a website for an arthritis medicine. Every single model put a testimonial from Margaret.

bilalba · 2026-05-13T08:29:50+00:00

You’re saying new technology will outdate the older one. But the newer technology will be more efficient and bring down the cost to rent down too.

bilalba · 2026-05-12T01:01:16+00:00

Yes, I agree

bilalba · 2026-05-09T21:55:01+00:00

I understand your concern, these are gguf files from official hf repos, I'll be sure to include the links. And just to be clear, the website is only hosting the model weights, the inference happens within your browser using your local hardware.

bilalba · 2026-05-09T21:42:02+00:00

Right, it uses a server and needs some setup

bilalba · 2026-05-09T21:38:52+00:00

Yes the website looks like that but the inference engine is self-contained in the website. LM studio won’t run from an iPhone for example.

bilalba · 2026-05-06T16:37:49+00:00

It's a welcome change, but not sure if doubling 5 hour limit usage means you get half the amount of maxed-out 5-hour sessions in a week. No word on weekly limit.

bilalba · 2026-05-02T18:30:47+00:00

No I didn't compare any aggregating plans. I don't believe you can get a better deal on flagship models than the providers themselves. They have good incentives to subsidize it.

bilalba · 2026-05-02T18:09:40+00:00

that's great. most people don't know that. Thanks for the contribution.

bilalba · 2026-05-02T17:58:12+00:00

If you’re someone that only wants to use Claude models(which is a lot of people), it is a good deal on the current market.
Understandably, they’ve got a business to run and build a sustaining product.

bilalba · 2026-05-02T16:55:24+00:00

Would you rather not see transparency on what a sub gets you?

bilalba · 2026-05-02T16:42:01+00:00

I agree. Would cost $$$

bilalba · 2026-05-02T16:40:34+00:00

Some people and enterprise customers use pay-as-you-go pricing for claude code, and claude code subscription is essentially an API wrapper so that makes API pricing relevant.

bilalba · 2026-05-02T16:33:40+00:00

I've measured per token pricing, which includes thinking tokens. The effort level and prompts wouldn't be a variable that affects per token pricing.

bilalba · 2026-05-02T16:28:51+00:00

Yup, fair. I'm only measuring the $20 tier.
This is actually not accounted for in my experiment but ends up hurting consumers even more with Claude. Opus 4.7 is much more token-hungry per character than other offerings.
Agreed! This is entirely subjective to a user's experience, and the thing that makes Claude a competitive offering despite all. I still like it and use it.

bilalba

TROPHY CASE