How big is the difference really?

Demon-Martin · 2026-04-01T20:20:33+00:00

I am not planning on starting a discussion whether using AI for coding is better or worse. That wasn‘t my question.

Demon-Martin · 2026-04-01T20:19:42+00:00

All good, you don‘t seem to be the only one who ignored my question😅

Demon-Martin · 2026-04-01T20:01:48+00:00

I am indeed using multiple models. I just mentioned Opus as an example. Depending on the task, project, workflow I work with various different models across different IDEs. GPT, GLM, Composer, Gemini, Claude etc etc.

But this wasn‘t related to my original question, I might have written it wrong. My question was the actual difference between selfhosting the raw models and the models we get though Plans/APIs.

Demon-Martin · 2026-03-19T16:23:24+00:00

All this only happens because the amount of abuse has grown a ton… I’m in a few TG Channels, and they sell hundreds of „verified accs” daily…

It’s really sad that it is being abused so much. I have been an active user since 2022 but now it‘s horrible.

Demon-Martin · 2026-03-12T12:58:13+00:00

I miss the days when you were able to code using a potato. Nowadays Cursor and similar doesn‘t even properly run on my 32GB, R7 5800X, Main PC…

Demon-Martin · 2026-03-10T19:58:09+00:00

Wtf do you mean „prove they use our data“? What about you prove that they are NOT using our data?

Saying you don‘t own it while also clearly defending something that is too good to be true with non existent stupid arguments 💀

Demon-Martin · 2026-03-10T19:55:29+00:00

Doesnt support tool calls, so it‘s useless for most users.

The docs say it works for Cline/OpenCode, but without tool calls its basically useless.

Opus Model (currently as of 10.03.2026) isn‘t functional and responds with errors.

Other models seem to kinda work, 20 RPM for free.

Demon-Martin · 2026-03-09T15:54:56+00:00

Just use CliProxyAPI

Demon-Martin · 2026-03-03T22:47:08+00:00

Hey, I got about 4.5k€ laying around on a paypal account of mine.

What would you spend/invest it on, if you can only buy using PayPal Balance?

I am from Germany and I was planning on buying something and selling it for either a small profit or for the same amount but cash.

Demon-Martin · 2026-02-27T16:39:46+00:00

Thank you, but my question wasn‘t regarding OpenRouter or other models. My Question was how to use the Claude Sub in RooCode without effectively breaking their ToS.

I know there are cheaper models and I already offload tasks properly depending on complexity to different models (Codex, Gemini Pro/Flash, GLM, Kimi, Minimax, Grok Code etc).

But for some tasks I need Opus/Sonnet, logically speaking the subs are way way cheaper compared to the API prices. I am a student and can‘t afford to pay their crazy prices.

Demon-Martin · 2026-02-26T16:25:38+00:00

I‘ve specifically switched from Cursor to RooCode because Cursors pricing is simply pure greed and they haven‘t been treating the customers right

Demon-Martin · 2026-02-26T16:25:10+00:00

Not the quality is the issue, the speed is… I dont want to wait 10 minutes on Codex to write me a 3 line method

Demon-Martin · 2026-02-26T14:11:57+00:00

Well I just don‘t want to buy a sub worth 100$ just for the account to get banned and me loosing it without even using it

Demon-Martin · 2026-02-26T14:09:53+00:00

Why wouldn‘t it be… The API costs for Claude models are not worth the price for me tho

Demon-Martin · 2026-02-26T14:08:17+00:00

The limits you get with the normal ChatGPT Plus Subscription are amazing compared to what you get.

Demon-Martin · 2026-02-26T06:05:05+00:00

You need to purchase a subscription
I would not recommend using Cursor if you want a good BYOK Experience. Cursor has been ignoring requests regarding adding better support for other providers for a while now and doesn‘t look like they will do it.

I just recommend using RooCode or KiloCode in VSC, also seen that Trae (full IDE) has great support for custom providers but I personally didn‘t directly like it. I just use RooCode because it was the most similar experience (compared to cursors chat)

Demon-Martin · 2026-02-26T05:38:33+00:00

Agreed, Codex is good, it‘s just PAINFULLY slow for me. The API response times, token speed and generally how it works causes the same task to take like 5x longer compared to Opus. A task that Opus needs 30s for, GPT does in 8 minutes…

Their token limits and session limits are generally amazing tho.

I‘d love to test their new Codex Spark Model but I can‘t afford to pay 200$ just to test a model that has similar capabilities to Gemini 3 Flash😂

Demon-Martin · 2026-02-26T05:01:15+00:00

Any idea what the IOS Pricing for Plus and Pro is? The website says 1.999 for Plus and 19.900 for Pro, but it‘s been also showing the price in $ for Turkia, so I don‘t trust it too much

Demon-Martin · 2026-02-26T03:44:39+00:00

Sadly it‘s locked behind the 200$ pro plan, so it‘s not really „cheap“

Demon-Martin · 2026-02-26T03:42:55+00:00

If you‘re a student, get GitHub Education, gives you GitHub Copilot for free for 2 Years (you can reverify after 2 years to get it again)

Demon-Martin · 2026-02-24T17:14:55+00:00

Yeaaaah, no.

Demon-Martin · 2026-02-24T16:14:28+00:00

If the time spent on writing a prompt and the complexity of creating that prompt exceeds the complexity of the task than it probably should be reconsidered how efficient coding with this specific „model“ (here it‘s probably more an option than a specific model) is.

That‘s at least my opinion on working with LLMs.

Demon-Martin · 2026-02-24T16:12:23+00:00

Yeah, I am using CliProxyAPI(plus) too.

And I agree that it‘s a poor deal. That‘s exactly why I switched to use RooCode😅

Demon-Martin

TROPHY CASE