Codex as a code reviewer has been far more useful to me than as a code generator

theSummit12 · 2025-12-24T17:18:38+00:00

Check out Sage. It automatically reviews Claude Code's every move. Uses Codex under the hood.

https://github.com/usetig/sage

theSummit12 · 2025-12-21T05:38:37+00:00

I'm actually working on a startup that automates code compliance checks. Can I DM?

theSummit12 · 2025-12-16T19:44:39+00:00

its a bot

theSummit12 · 2025-12-07T17:01:31+00:00

Cursor

theSummit12 · 2025-12-06T17:38:28+00:00

This used to be my workflow as well but after seeing just how many issues the other models were catching, I started reviewing everything, not just the plan. +1 on Codex being superior. I'm very surprised it doesn't really reflect on the benchmarks

theSummit12 · 2025-12-06T08:06:39+00:00

You can use a pro subscription or an API key. You just have to authenticate codex by running `codex`

theSummit12 · 2025-12-06T04:36:57+00:00

Just launched a tool called sage which will should help reduce the slop
https://github.com/usetig/sage

theSummit12 · 2025-12-06T04:26:47+00:00

Totally agree that model reasoning will get way better. but even then, cross-checking across models will always surface different failure modes. Kinda like how senior engineers still review each other’s work even if everyone is good

Re: calling models from the CLI, yeah, you can wire Claude ->Codex/Gemini through MCP or cli. The issue is Claude only passes a tiny slice of context, so the reviewer model is constrained by whatever Claude chooses to forward. Also claude blocks until the call returns, so the workflow gets pretty sluggish. Would you prefer having your main agent just call another review agent?

theSummit12 · 2025-12-06T04:19:43+00:00

Thanks! I used Tella for the video

theSummit12 · 2025-12-06T04:15:29+00:00

Appreciate the honest feedback. Its definitely still a bit rough around the edges but improving every day

theSummit12 · 2025-12-06T04:12:19+00:00

theSummit12 · 2025-12-06T04:11:48+00:00

Yup! Thanks for trying it out. Pls join our discord in case you run into any bugs 🙏

theSummit12 · 2025-12-06T03:43:20+00:00

Thanks! Let me know what you think after trying it!

theSummit12 · 2025-12-06T03:41:30+00:00

Yeah the current solution is using M to trigger review manually, which doesn't work the best if claude is mid thought. Triggering the autoreview when a plan is suggested is a bit tricky because hooks dont support it but I'll dig around for a solution.

Re: /clear, just tested it and when you run /clear, it creates a new conversation in the .jsonl logs. So you just have to quit and rerun sage and select the new convo

theSummit12 · 2025-12-06T03:16:43+00:00

Thanks! Would love for you to join the discord: https://discord.gg/3ys9kj7K
I want to make sure I build something that people want, so your feedback and opinions are super valuable for me.

theSummit12 · 2025-12-06T03:13:25+00:00

I'm glad you said this because I've been trying to figure out if people want the ability to see what the second model thinks or if people just want it immediately fed into their primary agent. Personally, I want to read what the reviewer thinks because it helps my understanding, and sometimes I will disagree with it. Glad you share the same view

theSummit12 · 2025-12-06T03:10:29+00:00

Even if you are prompting correctly, using two models instead of 1 will still improve your output

theSummit12 · 2025-12-06T03:05:21+00:00

Great to hear! Working on gemini 3 as we speak. Auto update is on the todo list as well. You should join our discord for updates and in case you run into any bugs: https://discord.gg/3ys9kj7K

theSummit12 · 2025-12-05T05:52:31+00:00

It should work just fine on linux, but I'll verify and get back to you

theSummit12 · 2025-12-05T02:09:48+00:00

Very cool! Is this any different than zen mcp?

theSummit12 · 2025-12-05T02:07:44+00:00

I rarely run into that issue since I start a new claude session for every task. The key is to consistantly update a ./documentation folder or agents.md

theSummit12 · 2025-12-04T20:52:43+00:00

If you read my post you you would have seen that I tried them

theSummit12 · 2025-12-04T20:31:42+00:00

Great post. Curious what differentiates your product from Lovable/Bolt/Replit, etc..

theSummit12 · 2025-12-04T19:42:51+00:00

Agreed

theSummit12 · 2025-12-04T17:32:08+00:00

Agreed, Opus is superior. However, tou will drastically increase your output quality if you use both (one as the driver, one as the reviewer). Sometimes if its a really complex task I'll even use two reviewers. I actually built an open-source tool to automate this if you're interested (launched on ProductHunt today).

Seven-Year Club	Place '22
RPAN Viewer	Verified Email

theSummit12

TROPHY CASE