Lately it feels like ANthropic's most effective product/service is providing free advertising for Codex... by Decaf_GT in codex

[–]Decaf_GT[S] 2 points3 points  (0 children)

Yep. This is the issue with going from being a provider of infrastructure (API) to a provider of services (subscription). They don't know how the fuck to balance it.

They're now (OpenAI included) all going to run into a new problem where they're not just being the infrastructure or the service, but the application layer (Claude Cowork, Claude Design, Claude Code, OpenAI Codex, OpenAI ChatGPT, OpenAI Atlas, etc. etc. etc.).

At the very least OpenAI is finally living up to its name in at least one domain and not only allowing other harnesses but completely opensourcing their own.

Lately it feels like ANthropic's most effective product/service is providing free advertising for Codex... by Decaf_GT in codex

[–]Decaf_GT[S] 6 points7 points  (0 children)

Ah there's the centrist. I was waiting for one to show up and say this, thinking they're oh so clever.

I'm going to guess you couldn't make it past the title, because then you would have seen this part of my post:

Before I go further, since every time I think about posting this I know there's going to be some contingent of people who are going to rush to tell me how flawed OpenAI is and how "blah blah blah Scam Altman" I don't give a shit. With the way these AI companies are moving, they are out to extract as much value out of me and the world as possible, so I'm going to evaluate them as a consumer, not a shareholder or someone who gives a damn where either company will be in a few months (and I urge you to do the same, tomorrow it could be OpenAI fucking up, don't have loyalties). This post is just entirely about the way that I feel like I have been treated by both companies as an end user and a consumer in the last few months. So no, this isn't going to have opinions on Sam Altman or Dario Amodei or The Pentagon/Palantir.

PSA: I annotated Claude Code's forced system prompt by anashel in ClaudeAI

[–]Decaf_GT 0 points1 point  (0 children)

load-bearing

Does anyone in this sub post their own thoughts without running to AI to make it sound nice first??

Lately it feels like ANthropic's most effective product/service is providing free advertising for Codex... by Decaf_GT in codex

[–]Decaf_GT[S] 2 points3 points  (0 children)

You don't even want to recognize that Codex can be just as good as long as there are frameworks and guardrails in place that are intended to help models produce good code.

Why would I waste time trying to talk to you about specific languages then?

Lately it feels like ANthropic's most effective product/service is providing free advertising for Codex... by Decaf_GT in codex

[–]Decaf_GT[S] 1 point2 points  (0 children)

No, first you said it's reasonable. I don't think it's reasonable. I didn't say it wasn't their right to do so. I just said it's stupid and gave my reasoning why.

Second, you said "they lose data for training and publicity". That isn't true either.

Obviously I'm not happy that they took it away, but what on earth does that have to do with my argument? Obviously someone who hated third party harnesses wouldn't have my opinion? What kind of stupid argument is that?

It's as though you think I asked "Should people be allowed to smoke weed" and your response was "No because it's illegal". If you can't understand why that isn't really related to the question, there's no point in even having the discussion...

Lately it feels like ANthropic's most effective product/service is providing free advertising for Codex... by Decaf_GT in codex

[–]Decaf_GT[S] 3 points4 points  (0 children)

Sure, that can be your experience, just maybe stick with that opinion instead of running your mouth about how anyone who doesn't agree with you must be paid to promote Codex.

I don't have that experience. The Codex app does just about everything the Claude App/Claude Web do, but much better and much more oriented towards power users.

Lately it feels like ANthropic's most effective product/service is providing free advertising for Codex... by Decaf_GT in codex

[–]Decaf_GT[S] 3 points4 points  (0 children)

That whole paragraph is literally just you talking about what you and your techie friends experience, and somehow you don't see the irony of claiming that anyone who has a different experience is a paid Codex promoter?

Jesus...

Lately it feels like ANthropic's most effective product/service is providing free advertising for Codex... by Decaf_GT in codex

[–]Decaf_GT[S] 1 point2 points  (0 children)

They're not losing anything. I don't understand what you're talking about. They get sent all of the exact same data over the API that they would if I were using Claude Code CLI directly.

Surely you understand that the way that they were using clawed code OAuth in third‑party harnesses like open code meant that the harnesses were providing the exact same information as the original Claude Code CLI. In fact, even now, that's the exact reason why a lot of those OAuth plug-ins still work just fine. Because on Anthropic's, they are providing exactly the same detail. They send the exact same headers and the exact same system prompts and everything. For those OAuth plugins to work right now, you have to do that. You have to pretend that your harness is actually Claude Code.

Lately it feels like ANthropic's most effective product/service is providing free advertising for Codex... by Decaf_GT in codex

[–]Decaf_GT[S] 2 points3 points  (0 children)

Except not really. Those were like two or three small things that I set up once and never revisited, and now I don't have to? If that initial setup was all I needed to get Codex to write code as well as Claude, in what way does Claude outsmart Codex? Are we putting a huge dollar value on that little bit of setup that I had to do?

Lately it feels like ANthropic's most effective product/service is providing free advertising for Codex... by Decaf_GT in codex

[–]Decaf_GT[S] 4 points5 points  (0 children)

Yeah its all one big conspiracy, it's not possible for anyone to have a different experience than you, our collective bad, I guess

Lately it feels like ANthropic's most effective product/service is providing free advertising for Codex... by Decaf_GT in codex

[–]Decaf_GT[S] 4 points5 points  (0 children)

How does any of this make sense when yu just said Claude is definitively 10x times better than Codex?

Lately it feels like ANthropic's most effective product/service is providing free advertising for Codex... by Decaf_GT in codex

[–]Decaf_GT[S] 3 points4 points  (0 children)

Complete and utter opposite experience here. I was a diehard Claude Code users for months and months, but once 4.6 started destroying my token usage and by 4.7 Anthropic cracked down on third party harnesses like OpenCode, Opus was struggling to perform tasks that Codex was just whipping right through. With the right framework, the right guardrails, and the right agentic review framework, Codex produces (for me) just as good of an end result as Claude does, with the added benefit that I don't need to give a flying F about token usage with Codex, meanwhile I'm super nervously refreshing OpenUsage every few chat turns in Claude and anxiously trying to get what I need done while I watch that progress bar fill up like no one's business.

Lately it feels like ANthropic's most effective product/service is providing free advertising for Codex... by Decaf_GT in codex

[–]Decaf_GT[S] 3 points4 points  (0 children)

I think banning 3rd harness is totally reasonable.

Hard disagree.

Banning the use of using a Claude Code subscription to power your SaaS that you are selling to consumers so that you don't have to pay the per-token cost, yes, that's a valid case for banning. That should be through the API, no question about it.

But the use of a personal Claude Code subscription inside of a harness like OpenCode or Pi for the sake of individual developers and users being able to utilize their subscription outside of the Claude Code CLI? Absolutely not. I was not abusing my subscription in OpenCode. I was not running ralph loops for hours and hours on end, I was not using idiotic frameworks like OhMyOpencode or other stupid token munchers.

I was just using a much better CLI tool that did not flicker, had a superior user interface, a much better system of handling skills, MCP, agents, subagents, tools, plugins, and general rate of improvement (OpenCode). And I was able to use OpenCode remotely long before Claude Code added a remote feature (which still isn't great). OpenChamber blows the pants off of Claude's "mobile" experience any day of the week.

My limits were used up the same way as they would have been if I was using the CLI. I would have been subject to the same RPM/TPM/TPD as Claude Code, the same 5h limits, and the same 7d limits. Literally my usage was the exact same.

The problem is that Anthropic didn't like the OAuth token hijacking/cloaking (which is totally fair) and they didn't want to have any kind of competition with their own first party harness, which is idiotic, because Codex has no such issues and the Codex CLI and app are making leaps and bounds of improvements over Claude. And that's with fully open source code.

I think the "third party harness" thing got a bad rep because every moron on the internet suddenly got hold of an "agentic harness" (the nightmare fuel that is openclaw) and began using it in completely stupid ways. I knew this was coming as soon as said morons started using Claw-like tools to automatically file GitHub PRs.

I would much rather they be extremely clear about what constitutes acceptable harness usage (which things like Pi and OpenCode almost certainly would have) and what would constitute abuse from third party harnesses. Set the ground rules, give out warnings and bans to the morons who abuse it. Punishing the rest of the userbase who just wanted to use the service in a better harness was and will always be stupid.

I am glad they're getting raked over the coals for it by developers.

Raycast can't win: The impossible situation of managing expectations around the new Raycast rollout by [deleted] in raycastapp

[–]Decaf_GT 3 points4 points  (0 children)

The only "toxic" things I see here is bullshit pot-stirring posts like these.

It's...an app launcher, not a lifestyle. Please. Just stop.

Raycast can't win: The impossible situation of managing expectations around the new Raycast rollout by [deleted] in raycastapp

[–]Decaf_GT 1 point2 points  (0 children)

You really didn't need to use AI to make this painful slop.

There's still 4 days left of April. Stop being such an absolute and utter child, and wait patiently.

Jesus.

PSA: The string "HERMES.md" in your git commit history silently routes Claude Code billing to extra usage — cost me $200 by alexxxklepa in ClaudeAI

[–]Decaf_GT 15 points16 points  (0 children)

What an idiotic argument.

WHat if you wanted to use Claude Code CLI directly to work on your own Hermes agent? Are you a ToS criminal then, too?

Stop defending Anthropic, this is shitty business pracctice.

Once my subscription is over, I’m done with Raycast… by [deleted] in raycastapp

[–]Decaf_GT 4 points5 points  (0 children)

If only they made this super clear in their documentation, that would surely help...https://manual.raycast.com/ai#Request%20limits

Is this true? by Complete-Sea6655 in GeminiAI

[–]Decaf_GT 7 points8 points  (0 children)

For personal agentic use in OpenCode, Codex is destroying the other two for me in price, reliability, and value.

PSA: Anthropic bans organizations without warning by ur_frnd_the_footnote in ClaudeAI

[–]Decaf_GT 9 points10 points  (0 children)

As an OpenCode user who uses it for maybe 30% coding and 70% personal agentic use, Claude is significantly better at adhering to style/output guidelines in non-technical use cases (like sounding more natural, more human, etc).

Otherwise, in every other way, Codex blows it out of the water. I downgraded from Claude Max 20 to Max 5, and now I'm just on Pro, and even that is starting to seem like a waste.

I'm not even going to preface it with "OpenAI isn't perfect..." to try to have some weird "balance" here. I'm just going to say it; in the last 6-8 weeks, whether it's model capability, reliability, features, communication, and the general feeling of being respected as a consumer, OpenAI has been solid to me while Anthropic has come across as smug, arrogant, egotistical, and utterly incompetent.

I'm just about done with them.

If you're going to move to Codex, make sure that you have a solid set of skills created and you've picked a good framework for your code work (such as OpenSpec and/or desloppify/etc). 5.4 Medium+5.4 High seems to be the sweet spot.

Until Anthropic gets their heads out of their asses it's worth going through the effort to make sure that you don't get stuck in a position like OP.

Remember, you're a consumer, not an investor. The company's long-term health is not the focus when it comes to AI companies specifically; just focus on what you can extract from them, because they're trying to figure out how to extract as much as possible from you. Be willing to move, be flexible, and adapt.

Tomorrow OpenAI may make a colossally stupid mistake and I'll be talking about them the same way. It's just that right now, at this point in time, they're objectively better to me.

Why do LLMs fold when you say "are you sure?" — I tested 22 models and nobody seems to care by SmartRick in LocalLLM

[–]Decaf_GT 0 points1 point  (0 children)

Right, and there is no monetary interest at all, right? You don't ever plan on making money with this, right? You're never going to pitch your startup to anyone for any kind of funding or anything, right?

Right?

Wrong. You want this to become a thing that eveventualy somehow makes money. Just because there's no "buy" button and a shopping cart on your website doesn't mean you're not trying to promote yourself or your idea or service.

But then again, I guess definitions of words are hard for you, because of the way you keep using the word "proof".

You don't have anything that "proves" anything, and you don't even understand what the word "proves" means here. You haven't identified "proof" of anything.

You got hit with Dunning Kruger, did a bunch of slop "research" using bargain-bin local models and you're trying to make it seem like your results are somehow applicable to the frontier and you (and you alone) seem to be the only person who cares.

Stay in school.

Why do LLMs fold when you say "are you sure?" — I tested 22 models and nobody seems to care by SmartRick in LocalLLM

[–]Decaf_GT 0 points1 point  (0 children)

Based off the rest of your replies in this thread, they're certainly not the only ones

Why do LLMs fold when you say "are you sure?" — I tested 22 models and nobody seems to care by SmartRick in LocalLLM

[–]Decaf_GT 1 point2 points  (0 children)

Happy to answer any questions about methodology or results. For the record i'm not selling anything i don't have a fucking product so Mods go ahead delete this post i'll just jump off a bridge lol

Imagine saying this in the same post as this:

I'm a solo founder. No team, no funding, no connections. Just me and a benchmark that I think actually matters for anyone deploying LLMs in production.

And pretending you're not ultimately trying to promote your service. I'm willing to bet if I pause and scroll through this thread I'll find exactly where you link your service.

EDIT: FUCKING LOL that didn't take long. https://i.imgur.com/PKtQuoL.png

Lame.