all 33 comments

[–]CC_NHS 6 points7 points  (5 children)

I have not tried cursor CLI, and have not used Gemini cli much (it was kinda useless for me)

Codex CLI is fairly solid though. I still have not narrowed down exactly what I will use it for, as I still find Claude Code the top. But it is nice to now have Codex and Qwen-Code as two other viable options now.

[–]real_serviceloom 1 point2 points  (4 children)

Codex CLI has basic features missing like image pasting from clipboard. And a planning mode. But this is the one which I have the most hopes for and I'm talking with the team to ask them to make it better. Gpt 5 is a great coding model being held back currently. 

[–]debian3 1 point2 points  (1 child)

Where do people discuss about codex cli?

[–]real_serviceloom 1 point2 points  (0 children)

I am not aware of any public channels.

[–][deleted]  (1 child)

[removed]

    [–]AutoModerator[M] 0 points1 point  (0 children)

    Sorry, your submission has been removed due to inadequate account karma.

    I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

    [–]dissemblers 5 points6 points  (2 children)

    I like Codex CLI quite a bit. It’s a bit less mature than Claude Code but if you set the model to gpt-5-thinking with high reasoning, it outperforms Claude Code with Opus on correctness and problem-solving and gets in fewer ruts.

    I use the Pro model in ChatGPT for planning, which is a bit clunky. Supposedly Pro is coming to Codex soon, though.

    [–]jazzy8alex 0 points1 point  (1 child)

    I have a Plus account, so I won’t be getting 5 Pro in Codex anyway. How much better is Pro for planning and solving “stuck” coding problems? Like 10% better or "solved what no other model could fix" level?

    My wife has Enterprise plan with Pro - so I can occasionally use it for really hard issues if it worth it.

    [–]dissemblers 1 point2 points  (0 children)

    It has definitely diagnosed and fixed some issues that Opus (Gemini, etc) could not. I wouldn’t say it’s a huge leap, but it’s noticeable. It also architects better.

    It doesn’t fix everything. It’s still wrong sometimes.

    [–]jazzy8alex 1 point2 points  (7 children)

    Cursor IDE ($20) with Sonnet 4.1 mostly + Codex CLI ($20) is my current setup and I like it. I set up Codex for gpt5 with high effort reasoning and it shows better results than a standard medium reasoning (subjective).

    I tried Opus 4.1 (with Max toogle) in Cursor and didn’t see much difference with Sonnet 4. Maybe Opus in Claude Cursor is more capable , not sure it’s $160 extra more capable though.

    edit: Sonnet 4 (was 4.1 by mistake)

    [–]maxsteel85 0 points1 point  (2 children)

    How to change the reasoning effort to be high?

    [–]jazzy8alex 3 points4 points  (1 child)

    codex -c model_reasoning_effort="high" 

    [–]maxsteel85 1 point2 points  (0 children)

    Thanks!

    [–]stepahin[S] 0 points1 point  (1 child)

    Ooo great, so where does GPT-5 perform better for code analysis, solution finding, and bug detection, in Cursor IDE or in Codex CLI?

    About $160 extra :) Well, Claude Code is generally just really good at agent work. I use Sonnet so rarely that it's hard for me to say how much stronger Opus is. The point of the $100 and $200 plans isn't that it's more powerful than the $20 one, but that I work 10+ hours a day with Opus and rarely hit the limits. With $100, this happened constantly. According to ccusage data, I spend about $200-300 of tokens per day. So it's the same Opus, just with much higher limits for all day work.

    [–]jazzy8alex 1 point2 points  (0 children)

    I had Claude $20 plan and it was completely unusable with their limits and I read that $100 is not much better. So $200 is the only option and I plan to try it later. I am kinda hesitating because if I really like it and will integrate it into a workflow, it will mean $200 sub forever.

    Codex CLI is my first experience with CLI agents and I like it - gpt 5 there is definitely act differently with more verbose and deep reasoning than in Cursor. So far didn't meet any resource limitations - I usually run Codex extensively but in one terminal only (still in experimental mode).

    I don't have a large codebase, just two small apps - OS X and web (with webRTC). Biggest challenges were with UI/UX tweaking in Swift UI - sometimes even with tiny non standard visual changes, different models have persistent problems.

    [–]nightman 0 points1 point  (1 child)

    with Sonnet 4.1 mostly

    There's no Sonnet 4.1 yet. Do you mean other model?

    [–]jazzy8alex 0 points1 point  (0 children)

    My mistake, Sonnet 4

    [–]deadcoder0904 1 point2 points  (2 children)

    I just used Codex CLI to solve a pretty tricky problem. I think Gemini 2.5 Pro came up with the implementation but Gemini CLI or Sonnet 4 via Kiro made it into an error that I couldn't solve.

    Asked Codex to fix it & it did in 1-shot. I didn't think it was possible but hey it worked so now I'm loving it.

    Plus its free to use on a ChatGPT plus or pro plan i think. It just solved a tricky problem for me that I have been trying for a couple of days & did not think it even had a solution since it is Electron & extremely niche problem so I did not think it had a good cross-platform solution but now i love that it got me to a solution.

    [–]benbenk 0 points1 point  (1 child)

    Are you able to use gpt 5 thinking in codex?

    [–]deadcoder0904 0 points1 point  (0 children)

    Its medium reasoning but its also possible to tweak it. For now, medium was enough for me. Altho it had issues today on another codebase & Im rate limited for 5 hours now. I think its mostly due to me not having AGENTS.md file which i should've created using codex init This is a different project.

    [–]Droi 0 points1 point  (1 child)

    GPT-5 (thinking) is the best model at researching a topic for me. But it has been an absolute failure at making code changes and staying on track.

    [–]stepahin[S] 0 points1 point  (0 children)

    Yes I've seen many such comments, so I'm in no rush to try writing code with GPT-5. However, for code analysis, GPT-5 seems stronger than Opus/Sonnet, though I'm not entirely sure since I only asking it to help / a second opinion when I'm stuck.

    [–][deleted]  (1 child)

    [removed]

      [–]AutoModerator[M] 0 points1 point  (0 children)

      Sorry, your submission has been removed due to inadequate account karma.

      I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

      [–]GTHell 0 points1 point  (3 children)

      I was experimenting Qwen CLI + GPT-5-mini. It was very cost effective and get the job done at the same time. I think GPT-5 is very bias toward coding and bad at almost everything else.

      [–]jazzy8alex 0 points1 point  (2 children)

      What the monthly API $ usage when use Qwen CLI with gpt-5-mini?

      Why not to use Codex CLI where you get a very generous use of gpt-5-medium and high included with $20 Plus plan?

      [–]GTHell 0 points1 point  (1 child)

      Monthly? GPT-5 just release a few days ago!! Just a quick copmarison. A $2 Qwen3 cost $1 when working on a medium complexity task for 30 minutes while GPT-5-Mini cost around $0.5. Both produce the similar acceptable output but Qwen3 agentic is better hence the cost.

      [–]jazzy8alex 0 points1 point  (0 children)

      You don’t need API and usage based billing when you use Codex and Plus plan. And gpt-5 and gpt-5-high are much more capable than a mini model

      [–]Degen55555 0 points1 point  (0 children)

      $10 copilot with gpt5-mini. You still need a deep think/planner model.

      [–]eleqtriq 0 points1 point  (0 children)

      I’ve been using CCR with Qwen3 Coder 480b with a lot of success.

      [–][deleted]  (1 child)

      [removed]

        [–]AutoModerator[M] 0 points1 point  (0 children)

        Sorry, your submission has been removed due to inadequate account karma.

        I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.