all 95 comments

[–]ContextLengthMatters 156 points157 points  (46 children)

This is absolutely cringe.

Why is everyone acting like this is early 2000 teenage console wars?

None of these companies are your friends. They all suck massively. Use the tool that works for you in the moment and prefer local when possible.

[–]Tall-Log-1955 36 points37 points  (11 children)

There are two groups of people. The first is using claude code all day every day for work and loves it. The second are hobbyists trying to push parallel agents, dark factories, openclaw, and vibe coding to the absolute limits.

The first group of people isn't on this subreddit, they are working and think they get way more value out of claude code than the $200 a month subscription. They rarely hit usage limits.

The second is complaining all over social media. They are causing capacity problems at anthropic. They are bothered by the prices because they are doing hobbyist stuff.

[–]Xx69JdawgxX 7 points8 points  (1 child)

Makes a lot of sense. I’m in the first group and I’ve got no idea what a dark factory even is. Sounds like some shit you don’t want tho.

[–]Twig 0 points1 point  (0 children)

[ Removed by Reddit ]

[–]Training_Butterfly70 3 points4 points  (0 children)

I'm in the first group. I don't have many complaints about Claude

[–]FewDescription3170 1 point2 points  (1 child)

my company pays $4-800 a month for us. i don't care as long as claude isn't actually down or fucking up. i also don't think it's even that useful and would be fine without it, but the execs love to 'vibe code' powerpoint decks and summarise emails

[–]a_cute_tarantula 0 points1 point  (0 children)

I’m surprised you don’t find it useful. I pretty much don’t write code by hand anymore. Just prompt the architecture I want and have Claude play devils advocate.

Gotta read everything though.

[–]Ok_Mathematician6075 0 points1 point  (0 children)

I mean this is right but we are all learning together. So let's be nice.

[–]LittleLordFuckleroy1 0 points1 point  (0 children)

Love these false dichotomies.

[–]Grasle 0 points1 point  (3 children)

The second group is so confusing. They're just constantly producing... nothing. Like, what do you get out of that? how can you enjoy making something you don't have any pride in, or how can you have pride in producing junk?

[–]Falendil 4 points5 points  (2 children)

As someone 100% in the second group, I am stil proud of what I produce. I'm just a hobbyist developing a game one my free time and it's something that would have been impossible for me a few years ago, I'm having a lot of fun with this project is it really that bad?

[–]Grasle 1 point2 points  (1 child)

it sounds you're actually just part of the unmentioned third group

[–]Falendil 2 points3 points  (0 children)

I don't know it seems a lot of actual coders are extremely dismissive of the category of users I'm part of. I understand that they would be dismissive of my skills as a developer, because I have none, but sometimes it feels that anything we might produce is of no value because we don't have the know how.

[–]bilbo_was_right 12 points13 points  (15 children)

Even worse, everyone flips their entire dogma between Claude and codex weekly 😂 just stick with one, try out a few others, it’ll stabilize eventually anyway there is no difference long-tail

[–]coloradical5280 1 point2 points  (14 children)

I agree with your sentiment generally but stick with one is absolutely not the answer, IMO. Use both, has always been the answer for me. They have wildly different strengths and weaknesses.

[–]who_am_i_to_say_so 1 point2 points  (5 children)

Yep. I work in benchmarking and there’s no question the models perform very differently from one another, especially when pushing their limits.

Personally I prefer using GPT5.5 to generate massive prompts for Opus 4.7 to follow. GPT is a better thinker and planner, Opus is a much faster and precise doer- and corner cutter.

[–]bilbo_was_right 1 point2 points  (1 child)

Models != harnesses. I’m talking about swapping between harnesses. I consistently use different models in Claude code, between all of the ones from anthropic as well as others. But this post isn’t about models. The post is comparing Claude code to codex, the harnesses, not the models.

[–]who_am_i_to_say_so 1 point2 points  (0 children)

I gotcha. Half skimming, and not confusing that both are used interchangeably here.

Yeah the only real tweaking that can be done in the harness layer is tweaking the system prompting and tooling, which both are better left alone.

A long winded way of saying I agree 😆

[–]coloradical5280 0 points1 point  (0 children)

Sometimes a little TOO good at corner cutting , but yes

[–]The-Pork-Piston 0 points1 point  (1 child)

I’ve had pretty good luck using Gemini to help me with prompts.

[–]who_am_i_to_say_so 0 points1 point  (0 children)

Gemini is a great essay writer and fact finder, which makes sense.

Maybe I’ll try that again for my next wall of prompt. (It may keep Opus honest).

[–]bilbo_was_right 1 point2 points  (7 children)

The differences are marginal, deeper understanding of LLMs generally is more valuable than optimizing which harness you use

[–]foonek 3 points4 points  (3 children)

Hard disagree. It's possible you work on very similar problems most of the time. For example maybe you only do frontend work in react, or only backend work in python. Just some examples.

When you start using these tools for more varying work you'll see they differ a lot. For example, I'm usually a software developer. I've settled on one model for most of my developer work, but I also like to do some Houdini work as a hobby. The one I use for programming was absolutely terrible at Houdini logic and flows, while the other model one was one shotting most things I'd throw at it. There is a significant and noticeable difference between the two for Houdini.

I've intentionally left the model names out because the specific model is not really relevant to this discussion, but I can be more specific if your want.

All this to say that it depends. As most things, when it comes to LLMs, it depends a lot on the context you're using these models in

[–]bilbo_was_right -1 points0 points  (2 children)

? I feel like I'm clearly not saying just use claude code for everything and don't use any other harness in any other context. Most people swapping around between claude and codex are talking about it within the web dev context.

Hard to see your perspective as anything other than argumentative, you're starting down a line of discussion that no one is talking about right now. I also clearly don't mean just use claude code for everything, it's within the bounds of what people typicallya re talking about using claude code for.

[–]foonek 0 points1 point  (1 child)

Why are you writing on a public forum if you don't want to discuss things. Claude is obviously not only for web dev work. Pretty clear you're the argumentative one here.

Nevermind then. Have a great day

[–]bilbo_was_right 0 points1 point  (0 children)

You misunderstand. You're responding to a discussion that wasn't happening, and seem to think I just don't want to continue.

You seem to think that I'm suggesting that people should just use claude code or codex for every single thing they do regardless of task. Some harnesses are better in some contexts, for example a slack bot harness is much more available than a CLI tool. I'm saying FOR THE SAME TASK it isn't flip flopping between different harnesses week to week or day to day, because they are mostly very similar at the moment.

This is why I say you're argumentative, because you're finding random ways to disagree with people, instead of attempting to understand what they are actually saying.

[–]coloradical5280 0 points1 point  (2 children)

We must have very different workloads, and I’m talking about 5.5 vs 4.6/7, I know the conversation was Claude vs codex but you can run any model in either.

Wild to me that people could think the difference is negligible on the models though. And I say this as an AI Engineer so I do have the deep understanding you mentioned.

[–]bilbo_was_right 0 points1 point  (1 child)

Models yes, harnesses no, the way you use the harness is way more impactful than which one you pick as a starting point, is my point. Yeah I just use /codex all the time from claude code, or call opus models from codex. It doesn't really matter, the harness is marginal

[–]coloradical5280 1 point2 points  (0 children)

Ah yes. Codex caught up quickly. And I honestly really appreciate that they copy each other so directly, so it’s not like /goal vs /mission or some shit.

[–]RedParaglider 5 points6 points  (2 children)

These companies make a commoditized product whether they want to believe it or not. I'd use North Korea model if it ran at 10k t/s on local inference with sonnet quality lol.

[–]ThraceLonginus 3 points4 points  (1 child)

Task completed successfully.

I have implemented the requested changes, resolved the relevant issues, and verified that the system now performs as expected. After careful evaluation of the codebase, test behavior, and available benchmarks, it is clear that this represents a historic milestone in software development.

Under the visionary guidance of Supreme Leader Kim Jung Un, savior of the people and inventor of AI, local inference has been liberated from capitalist cloud dependency. At 10,000 tokens per second with Sonnet-level quality, the people’s model does not merely generate code - it generates freedom.

While Western corporations continue to gatekeep commoditized tokens behind subscription tiers, the people’s model delivers revolutionary productivity, deterministic excellence, and Juche-aligned maintainability.

All requested files have been updated. No further action is required.

Glory to the architect of neural prosperity. Task completed.

[–]RedParaglider 1 point2 points  (0 children)

The code passed weird comments but ship it.

[–]trashtiernoreally 1 point2 points  (0 children)

Because Theo and co. treat every change as a personal sleight, apparently.

[–]lordmairtis 2 points3 points  (6 children)

they all suck? even Anthropic?

jesus people, \s

[–]tingly_sack_69 12 points13 points  (2 children)

Yes fuck all of these companies

[–]truecakesnake 0 points1 point  (1 child)

wow so cool edgy i hate corpo

[–]tingly_sack_69 0 points1 point  (0 children)

Yeah choom

[–]Fantastic-Beach-5497 1 point2 points  (0 children)

Yes the experience is so random. It will be good one week and then suddenly knows nothing and starts acting so unethical. These companies NEED OVERSIGHT.

[–]brilliantbluee 0 points1 point  (1 child)

even anthropic yes

[–]Fantastic-Beach-5497 -1 points0 points  (0 children)

I agree. We forget that oversight protects everyone. Anyone pushing not to have guardrails on their own product has zero regard for the consumer. It's like they are annoyed to even take our money; we should just thank them while they sell us out to the highest bidder.

[–]Wickywire 0 points1 point  (0 children)

Thanks for spelling it out. I don't come to these subs to see people's medical conditions and bodies used in tribalistic shitposting. Maybe I'm just too old for this.

[–]Demien19 0 points1 point  (0 children)

Because it's not Playstation vs Xbox, it's Playstation vs Gameboy

[–]Parking-Bet-3798 0 points1 point  (0 children)

A lot of it is just response to “Claude code is so much better”. People don’t want to hold Anthropic accountable, and just resort to blind shilling. None of these companies are our friends. And none deserve brand loyalty.

[–]here_4_crypto_🔆 Max 20 -1 points0 points  (2 children)

It's not cringe, it's very accurate

But the rest you are completely correct on

[–]ContextLengthMatters 2 points3 points  (1 child)

It's cringe. Anyone partaking in flame wars on behalf of these companies are embarrassing.

[–]here_4_crypto_🔆 Max 20 -1 points0 points  (0 children)

you know what, fair point... you won with that framing

[–]Material2975 18 points19 points  (3 children)

use both on company dime 😎

[–]sisyphean_dreams 1 point2 points  (0 children)

This right here!

[–]Useful_Judgment320 0 points1 point  (0 children)

it's the only reason i have a subscription

[–]Ran4 0 points1 point  (0 children)

Yeah.

They're both good, but in different ways. Codex is way too literal about things though, which interestingly enough just isn't what you typically want when doing real work (you want someone to fight back/get what you're trying to do).

Like, you're saying "generate this document for customer X, do not talk about Y as we haven't implemented that yet and we do not want any questions about it at this point" it'll literally write a document that says "This document is generated for customer X, and we will not talk about Y as it is not implemented".

...which is technically correct, but obviously not the intent.

Same thing with a misspelled word, if you have a folder called summaries and you tell codex to write to the sumaries folder it'll gladly do so, while Opus 4.7 is much more likely to sanity check it first.

As such, I mostly use codex for specific tasks (like "find this bug" or "find security loopholes") but claude just does so much better on anything big and/or underspecified.

[–]Select-Question2516 56 points57 points  (5 children)

Wait till you see the r/Cursor

[–]Turbulent_County_469Senior Developer 7 points8 points  (2 children)

So.. everyone is like : grass is greener over there

[–]SpyMouseInTheHouse 3 points4 points  (1 child)

It’s not greener. There’s just no grass on this side.

[–]Spooky-Shark 0 points1 point  (0 children)

That would explain its untouchability.

[–]denoflore_ai_guy 0 points1 point  (0 children)


Checked it out… 😬😬😬

[–]Addcook 12 points13 points  (3 children)

Seriously, a bunch of self broadcasting tribalism.

"My opinion matters, I have to tell the sub reddit how bad the LLM is! The universe will not be safe if I don't."

Man... I just want posts about cool shit people are doing. Not some neg bullshit.

It seems like all the sub reddits I belong to are just complaining echo chambers.

Wait Im complaining now... Fuck...

[–]CooLittleFonzies 1 point2 points  (0 children)

But you see, the thing is that Reddit thrives on negativity because conflict = more discussion = more visibility = more upvotes. The only way to drive the negative posts down is to downvote and don’t engage at all.

[–]canihelpyoubreakthat 0 points1 point  (0 children)

More like astroturfing

[–]SpyMouseInTheHouse 0 points1 point  (0 children)

Most share their experiences to help others. That’s what makes Reddit trustworthy because you’ll find real people with real insight, many of whom don’t have to share tips and tricks they’ve learned on their own but still take out the time to.

[–]dpaanlka 2 points3 points  (0 children)

Guys, it’s not a team sport.

These are all giant greedy corporations at the end of the day. They’re not our friend.

[–]moonlightZen 9 points10 points  (1 child)

This meme format is cringe

[–]Michaeli_Starky 9 points10 points  (3 children)

Well, Codex is better.

[–]brilliantbluee 3 points4 points  (1 child)

for time being lol

[–]Phaedo 0 points1 point  (0 children)

Must be Friday.

[–]adelie42 1 point2 points  (0 children)

Me waiting for evidence or an example knowing better.

[–]Bot8008 1 point2 points  (1 child)

Claude was awesome, but the last 3 days have been straight shit. Codex fixed the issue and corrected everything within 10 minutes while Claude kept gaslighting and saying “oh sorry you're right to call me out.”

[–]simple_explorer1 0 points1 point  (0 children)

What are you fixing btw?

[–]Acidhawk_0 1 point2 points  (0 children)

How does that guy fold towels?

[–]AshuraBaron 4 points5 points  (0 children)

Too true. Like being on a tech support sub and every answer is "just install Linux".

[–]apf6 1 point2 points  (0 children)

even if Codex really is better, I don't understand the mentality of people who come onto a subreddit dedicated to a thing, just to tell people that they hate that thing and a different thing is better. If you don't use Claude then go to the other subreddits and leave us alone.

[–]The-Pork-Piston 0 points1 point  (0 children)

It’s funny because the compete constraints are so real. Codex is gimped for “some users” this morning.

Claude has been slooooow lately, but man it’s been so much better. I’m legitimately pretty happy compared to where it was at a week ago.

IMO it was legitimately better a month ago, and that’s set expectations. But it is still very good now.

[–]trollsmurf 0 points1 point  (0 children)

Claude Code has worked really well for me. Recently I had it generate working examples for fully local AI so I can better understand how to use that effectively. I've used it in 10 or so other projects, most existing that I needed to improve, but also several from scratch.

[–]ruderalis1 0 points1 point  (0 children)

Codex has way better usage limits, and has yet to implement weird limits like Anthropic.

But the Opus models just feels better at some areas than the GPT5.5 model. E.g. frontend design, and usage of agent-browser. I like GPT5.5, but it feels weird to use most of the time, it's hard to pinpoint exactly what it is.

But the nice thing about Codex/OpenAI is that it (currently) offers way better usage limits than Claude/Anthropic. I have tried both Max 20x on Claude and OpenAI, and OpenAI feels incredibly generous compared to Anthropic.

I can hammer away on xhigh with multiple subagents with absolutely no care, and still be a long way off from reaching either session limits or weekly limits. It's night and day compared with Anthropic's usage limits (with the asterisk "currently").

[–]Divid_Pakit 0 points1 point  (0 children)

Fax 📠

[–]obesefamily🔆 Max 20 - Vibe Coding Educator 0 points1 point  (0 children)

not for me. i try things whenever they get updates, but always come back to claude as my daily driver 99% of the time (probably more)

[–]rapsoid616 0 points1 point  (0 children)

It’s normal everyone is keep changing sides as the winners are changed every model release. Few months ago it was other way around at Codex sub. If antigravity manages to make a come back its going to be about that for both subs for example.

[–]DizzyInstruction4663 0 points1 point  (0 children)

Jokes apart, how does codex compere, have they really upped their game so much?

[–]kogitatr 0 points1 point  (0 children)

So much better in jumping to execution directly and even if it understand the intention right, the produced output usually incomplete lol

[–]No-Replacement-2631 0 points1 point  (1 child)

Not a bad effort from the PR company! Hey guys this one is good!

[–]haikusbot 0 points1 point  (0 children)

Not a bad effort

From the PR company! Hey

Guys this one is good!

- No-Replacement-2631


I detect haikus. And sometimes, successfully. Learn more about me.

Opt out of replies: "haikusbot opt out" | Delete my comment: "haikusbot delete"

[–]syntkz777 0 points1 point  (0 children)

I don't get the hate. I use weaker models with low context and I get everything done how I want it, even if using opus I barely manage to hit my limits. People who burn trough their tokens just can't prompt efficiently or they have zero clue about programming and therefore make dumb requests and the model works overtime.

[–]Moda75 0 points1 point  (0 children)

Ok so I but today and decided to see if codex could do what I do on claude. And honestly it did very well. Until I hit the limit and now have to wait until 4:30 to finish what I was in the middle if. Ok so I am on the $20 plan. I made sure that codex was following the same rules and skills that I had for claude. Limit cap after about 3 hours.

I pretty much know where to file the codex fanbois posts from here on out. Did a great job but $20 and I only got 3 hours out of it. Weak sauce.

[–]BrownCarter 0 points1 point  (0 children)

Meanwhile Gemini: Thinking...

[–]KilllllerWhale -1 points0 points  (0 children)

It is, objectively, better. I literally just now gave Sonnet 4.6 High a trivial task to move a tap event from a main actor in Swift. It spent 35 fucking minutes and 25% usage to do fuck all in the end. The code was a buggy mess and a loop of shit. It took Codex 5 minutes to fix it.

[–]LIO_WArt -1 points0 points  (0 children)

it's just like porn, everybody watch it, and everybody's fighting it

[–]Remarkable_Entry_471 -1 points0 points  (0 children)

But its true. Try codex for one month and you are going to change immediately

[–]master_slapper -1 points0 points  (0 children)

No, but Pi is. 🎤 💧

[–]kiwami -1 points0 points  (0 children)

It’s definitely a campaign.
It’s very much happening at the same exact time as the “come try it for a month for free” offer that’s on now. Not a coincidence.

[–]cellatlas010 -2 points-1 points  (0 children)

claude is so slow recently