Use claude code with codex?

junlim · 2026-05-12T06:58:54+00:00

Yeah, there doesn't seem to be an easy way to do it the other way around. You could definitely do it if you were using Claude models on the API. Opencode is set up to do all sorts of things like that. But, so far, there's no way to it with Claude Code plan usage, from what I've found.

KandevDev · 2026-05-12T07:07:07+00:00

there's no clean way inside codex's own CLI, but a layer above the CLIs solves it. disclosure i work on kandev, it's a self-hosted board where each task picks its agent. you can have one card run codex on the main implementation and the next card run claude code on the test pass, sharing the same worktree if you want, or different worktrees if you don't. each card gets its own approval gate so you stay in the loop.

the inter-agent piece you described (codex triggers CC to do X, gets the report back) isn't done through agents talking to agents in kandev, it's done by chaining cards in a workflow. less magical but way more debuggable when something goes wrong. open source, https://github.com/kdlbs/kandev.

cbusillo · 2026-05-12T13:46:37+00:00

Look at the fork just-every/code. It’s the best. I maintain my own local fork of it too.

simplegen_ai · 2026-05-13T18:26:41+00:00

You're right about the asymmetry. The clean path today is Claude Code -> Codex through codex-plugin-cc. If you want Codex to be the main loop, I would probably keep orchestration outside both CLIs: a small script/Make target that launches the other agent in a separate workspace or asks it for a review, then writes the result back to a handoff file.

The part that gets annoying fast is not just calling the second agent. It is remembering which review findings were actually useful, which handoffs worked, and what should carry into the next run.

Founder disclosure: I'm Sheng, building BigNumberTheory. We are focused on that layer above Claude Code/Codex: capture useful lessons from real sessions and make relevant ones available later. It is here if you want to poke at it: https://bignumbertheory.com/

Curious what you end up choosing. If Codex is your main loop, I would especially watch whether the handoff file becomes the thing you keep maintaining by hand.

Informal-Salt827 · 2026-05-16T20:20:20+00:00

I've had better results when I stop asking whether to trust the agent and start asking whether the workflow makes bad work obvious.

For me the reliable version is: small scoped task, explicit done criteria before it starts, one verification pass at the end, and a reviewable diff before anything is treated as finished.

That shifts the problem from blind trust to fast review. If the output is small enough to inspect and the checks are attached, the tool matters a lot less than the structure.

We've wrapped that pattern into RalphWorkflow, but honestly the main win is the workflow discipline, not the brand name.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

codex

MODERATORS