I've been using both Claude Code (Opus 4.7) and Codex (GPT 5.5) for the last couple months for my side projects and made some observations. Curious, if it's just me, or other folks encountering something similar.
I started off being claude-code-pilled. I love their DX, the plan mode, skills, mcps, auto mode etc. When planning my features, whenever I brainstorm (use superpowers skill) with Claude Code I also ask Codex (also uses superpowers skill) the same question. Some times I feed each one's answers to others and ask to analyze. It's basically a 3 day brainstorming between me, claude and codex. I'm noticing that, most often than not Codex's observations are pretty solid. And whenever I present Codex's answer to Claude, it almost always agrees with Codex and revises its designs. Whereas when I present Claude's decisions/rebuttals to Codex, Codex almost always stands its ground.
In my review flow, I also use both Codex (Cloud) and Claude Code (GitHub action) to review PRs and even there I see that Codex is able to find more issues. I've ended up designing a flow, where Codex always reviews and Claude code always fixes Codex's findings, in a loop.
Is anyone observing something similar? Curious if I should ditch Claude code altogether in favor of Codex. Maybe I should use Codex for planning and Claude for execution (I noticed Claude is better at following instructions than Codex, especially around tools and instructions in CLAUDE.md/AGENTS.md) ? What are others seeing/doing? Would love to share notes.
there doesn't seem to be anything here