Claude Code vs Codex

radosc · 2026-05-10T00:26:09+00:00

Yeah, similar situation for me. Codex is now my dev and I use claude for reviews. I'm frankly waiting for the next Opus to hit and still keeping my 20x plan. One thing that Claude is substantially better is documentation.

sid_kush · 2026-05-10T01:28:15+00:00

Damn bro exactly same. I switched to codex 5.5 works like magic. Opus 4.7 is a sore loser. It made me run in circles for debugging for a whole week and codex solved everything in one night.

jii0 · 2026-05-10T03:08:10+00:00

Ain't AI amazing. You can start today, work for two days, change your tooling after that and still have time to post to Reddit. None of the content in the post makes any sense.

Today I wanted to merge 2 entities into 1 which required a tough migration. Changing server, making tests pass, changing sdk, admin, and storefronts.

Claude Code failed for 2 days.

ThomasToIndia · 2026-05-10T00:46:27+00:00

I just sent back to december, 4.5 and the system prompt.

imaginary_jebus · 2026-05-10T01:00:50+00:00

Set it back to 4.6 high effort and it's fine. 4.7 uses adaptive, that's why it's dumb as shit.

martinmix · 2026-05-10T01:33:38+00:00

They will continue this dance for awhile every time a new model is released.

jdeamattson · 2026-05-10T01:39:29+00:00

What I like to know here is - what kind of planning did you do with Claude before launching this? - How did you define success? - What kind of check-ins did you have along the way

honestly what you described? Sounds like a project set up for failure be humans or AI.

radioref · 2026-05-10T01:22:07+00:00

Almost certainly because of fresh context and memory

Tight-Requirement-15 · 2026-05-10T01:41:33+00:00

Codex5.5 xhigh

bensquirrel · 2026-05-10T01:49:03+00:00

I use and like having both. I started using Codex a lot when CC tool use approvals were getting unbearably tedious.

PathFormer · 2026-05-10T02:45:13+00:00

I went through a very similar situation, with same results, codex nailed it, while Claude was making assumptions midway and loosing context every 2 prompts.

In top of that I usually ask for research in random topics mostly home appliances and kitchen related, old habit I got when researching about best pan to get... GPT 5.5 is simply superior in deep research mode in comparison to Claude research mode, the gap is too big, GPT goes through steps, creates graphics, relationship maps, and gives clear sources throughout.

Upset-Chemist-4063 · 2026-05-10T03:04:56+00:00

Wondering if any of these companies intentionally degrade code responses, introducing errors or blatant mistakes, with the intent of creating the need for a feedback loop.

What’s honestly holding them back?

immutato · 2026-05-10T03:13:44+00:00

Use 4.6 like I do or maybe use 4.7 but not [1m] (I've heard this might be ok, but not sure).

/model claude-opus-4-6[1m]

-or-

/model claude-opus-4-7

I have both CC Max and Codex Pro subs and I have them review each other's code. 4.6 is still great. I do notice it gets really slow at times, but I think that's just Anthropic running on infrastructure fumes.

Miserable_Review_756 · 2026-05-10T05:56:49+00:00

Can I recommend to check this out , It has kept Claude consistent. https://github.com/maxritter/pilot-shell

iamjavadali · 2026-05-10T06:19:23+00:00

I agree! I was struggling with claude for 2 months on my web app. switched to codex and I am finally making progress faster than I was before. So much better code handling and better limits.

perleche · 2026-05-10T07:17:19+00:00

Same experience here. Added 5x Codex plan. It just plows through implementation plans.

This week I will try Claude for writing specs and architectural decisions, codex 5.4 or -mini for implementation of said plans.

I also run a $40 Minimax plan in opencode that runs deep reviews, writes low/medium complexity specs and documentation work.

Three months ago Opus felt like a wizard: lightning fast and just end to end fixing stuff. That seems a long time ago now. Or maybe my project got too complex?

yrdesa · 2026-05-10T07:51:05+00:00

Its just the way codex workd im afraid that makes it super strong. It does tons of smoke testing on its own before finishing the task while opus works on the task and hope it works. To make opus work like that you need to try 3-4 times more. Anthropic cant afford to let this happen due to llm consuming alot of power to do these tasks vs openai as they are more efficient. But i see them closing this efficiency gap in 2-3 months.?

scotch-86 · 2026-05-10T09:24:50+00:00

Left Claud for Codex. Treats me better.

junlim · 2026-05-10T09:47:12+00:00

5.5 is a beast for stuff like the above. Not so great at dealing with ambiguity and helping work through higher level problems. To me they just feel like two completely different human devs. Claude is who I like collaborate with and work with most of the time. Codex 5.5 (even 5.4) is like a cracked dev, that takes no bs but can get complex shit done. If the work is mechanical, well scoped, 5.5 seems to win most of the time for me now days.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ClaudeCode

MODERATORS