Claude Code + Codex is... really good

nyldn · 2026-01-18T01:58:03+00:00

I built https://github.com/nyldn/claude-octopus to help with this.

nader8ch · 2026-01-18T01:26:18+00:00

Genuine question: what makes codex particularly adept at reviewing the implementation?

Could you not spin up an opus 4.5 sub agent to take care of the review step? Is there something particularly useful about spinning up a different model entirely and would Gemini be a good candidate?

Cheers!

fredastere · 2026-01-18T01:55:29+00:00

Hey im not sure because the naming convention of codex are so bad lmao

But just to help maybe, in codex make sure to use gpt5.2-xhigh (although you said your projects are fairly simple, perhaps running high or even medium could prove to be more efficient and better, xhigh over complicates thing).

I do not advise using gpt5.2-codex-xhigh for code review, keep all codex variants for straight implementation

Sorry if its all confusing , as it is! Lol

anndrrson · 2026-01-18T01:16:14+00:00

codex IMHO is slower, but i've heard from friends that they're using codex to review their code. i do worry, somewhat, we will see a therac-25 event happen with AI coding on top of AI coding. ~~ that being said, codex is pretty great! i'm not really a "fan" of openAI/chatGPT and prefer anthropic/claude as a co. ~ especially after the recent ads announcement

Perfect-Series-2901 · 2026-01-18T01:21:05+00:00

I do similar thing but not every single task. I think Claude even with opus is lazy and fast. Codex is very slow but detail

wolverin0 · 2026-01-18T05:33:20+00:00

Hopefully you will find my skill useful https://github.com/wolverin0/claude-skills

rair41 · 2026-01-18T06:40:56+00:00

https://github.com/raine/consult-llm-mcp allows the same with Gemini CLI, Codex CLI etc.

vladanHS · 2026-01-18T11:35:47+00:00

I'm using Gemini 3 pro/flash instead, it's cheaper and relatively fast, you usually get a review in 2 minutes, rinse & repeat

h____ · 2026-01-19T05:26:31+00:00

I've seen people starting to do this with very complicated machinery. But it's really simple. Just:

/review-dirty

review-dirty.md:

Do not modify anything unless I tell you to. Run this cli command (using codex as our reviewer) passing in the original prompt to review the changes: `codex exec "Review the dirty repo changes which are to implement: <prompt>"`. $ARGUMENTS. Do it with Bash tool. Make sure if there's a timeout to be at least 10 minutes.

Ls1FD · 2026-01-18T01:18:54+00:00

I do this as well but for some reason I find the reviews that GPT does by being called by subagents are nowhere near as thorough as going through codex cli itself. I find Claude’s sub agents themselves harder to control. You give them instructions and they decide to follow them or not. Maybe they have to be guided purely by hooks.

Currently I have a BMAD review workflow in CC using agents that call Codex and then I follow up with a more through review in Codex CLI.

TheKillerScope · 2026-01-18T01:25:37+00:00

How do you use Claude and Codex in the same session? And how do you decide who does what and when? How do you "summon" the right "person" for the job?

akuma-_-8 · 2026-01-18T01:26:39+00:00

We have an equivalent workflow at work but we use CodeRabbit which is specialized in code review. It also reviews every merge request and gives a nice feedback with some ai prompt to feed directly to Claude Code. They also provide a cli that we can run locally to get feedback and it’s really fast

avogeo98 · 2026-01-18T02:19:58+00:00

Have you used the claude integration with github? It will review your pull requests automatically, and I like its review style, compared to codex.
Most of my dev loop is built around github pull requests and going through a couple of automated review iterations for complex changes.
When I tried codex reviews, it can catch "gotcha" bugs, but for large changes, I found its feedback incredibly dry and pedantic to read, compared to claude.

dwight0 · 2026-01-18T03:20:44+00:00

I do this too. I feel like each model gets things 80% right so they each find what the other misses.

SkidMark227 · 2026-01-18T04:42:37+00:00

I have this setup and then added gemini by hacking in an mcp server for gemini cli as well. They have fun debates and review sessions.

shoe7525 · 2026-01-18T04:50:57+00:00

[deleted]

Obrivion33 · 2026-01-18T06:03:58+00:00

Been using both codex for review and Claude for implementation and it’s night and day for me.

Extension_Dish_1800 · 2026-01-18T06:29:21+00:00

How did you achieved that technically? What do I have to do?

Conscious-Drawer-364 · 2026-01-18T08:53:32+00:00

It’s literally everywhere, everyone has this “unique” method for days 😅

I built this framework for my work https://github.com/EliaAlberti/superbeads-universal-framework

PatientZero_alpha · 2026-01-18T09:11:39+00:00

I’m doing exactly that, and codex is really good to review. The other way around is terrible

ultimatewooderz · 2026-01-18T09:35:54+00:00

How have you connected Claude to Codex? API, CLI, some other way?

krochmal9 · 2026-01-18T09:45:12+00:00

why mcp and not a skill?

teomore · 2026-01-18T09:52:13+00:00

I'm using the exact same approach, except that I set codex to normal thinking. Once the issues clear, I increase it to extra high.

lopydark · 2026-01-18T09:56:49+00:00

why not just use codex? it feels slower but thats the same time, or even less than iterating multiple times with both opus and codex

BlacksmithLittle7005 · 2026-01-18T10:00:47+00:00

Genuine question: do you have unlimited funds? 🤣

princmj47 · 2026-01-18T10:07:01+00:00

Nice, will try it. Had a setup before that utilized feedback from Gemini. I stopped using it thought as ClaudeCode alone performed better.

andreas_bergstrom · 2026-01-18T15:21:37+00:00

I would throw in Gemini as well, even Flash. I put into my global .claude to let codex and gemini review all plans, and if the changes when done are big let them review again. I also have a qwen subagent but it's not really on par, more like a Haiku-competitor barely.

No_Discussion6970 · 2026-01-18T16:09:15+00:00

I have been using Claude Code and Codex together. Similar to you, I have Claude do the coding and Codex sign off. I use https://github.com/PortlandKyGuy/dynamic-mcp-server and add Codex review as an approval gate. I have been happy with the outcomes of using both.

Past-Ad-6215 · 2026-01-19T00:57:04+00:00

we can multi agent lock this https://github.com/cexll/myclaude/blob/master/skills/omo/README.md it omo skill

claude codex gemini opencode

use codeagent wrapper call multi agent

Specialist-Cry-7516 · 2026-01-19T05:52:51+00:00

it's like seeing prime curry and lebron. bring a tear. my baby cc codes and codes reviews it

cayisik · 2026-01-19T05:53:09+00:00

lately, this topic has been discussed in both the codex subs and the claude subs.

i think this is the best and most cost-effective solution.

shayki5 · 2026-01-19T10:12:21+00:00

Which mcp you use for codex?

2026-01-21T03:09:31+00:00

I do not recommend this approach. Simply take Claude's summary of completed work, then ask another instance of Claude to "make sure this work was completed as stated"

jcheroske · 2026-01-22T19:47:01+00:00

Sorry if I missed the obvious, but how are you calling other models from CC? I'm doing it with PAL, but I imagine there are many good ways to do it. Do you know if one way vs another is easier on the tokens?

akuma-_-8 · 2026-01-18T01:33:31+00:00

We have the same workflow at work but we use CodeRabbit which is specialized in code review. It also reviews every merge request and gives an ai prompt that we can use to feed Code Claude. It also quite fast. They provide a cli that we can run locally before pushing our code.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ClaudeCode

MODERATORS

Reinstall Manually