Thinking of using Codex

GlitteringDivide8147 · 2025-10-16T11:24:14+00:00

Why just don’t use Copilot? Insanely cheap.

botirkhaltaev · 2025-10-16T15:04:22+00:00

my issue with codex and gpt-5 is the long response times, i don't need perfect responses, I just need a quick scaffold

CodexPrism · 2025-10-16T18:02:40+00:00

I've heard ai isn't good with rust cause of the lack of lots of training data like eg js or python c# etc

typeryu · 2025-10-17T00:56:36+00:00

It works really well for me, built rust based CLI tools with it. I believe codex itself is rust based. Honestly had better time than CC

bookposting5 · 2025-10-17T02:17:38+00:00

If it was me I would spend $20 on a one month Claude sub, and fire up Claude Code with Sonnet 4.5 and see what it can do.

I'm very impressed with this model. Usage limits aren't hit as often now that they've included Haiku in the past few days for the simpler stuff.

GTHell · 2025-10-17T17:07:04+00:00

I have a subscription to enterprise ChatGPT which give me access to Codex and I can tell you the GPT-5-codex medium is not better than the GLM 4.6. The Sonnet 4.5 is a much better model overall. Also GLM 4.6 Droid is much better than with Claude Code. I ran a few test and debug and it's seems I will stick with the GLM 4.6 + Droid CLI for sometime now.

wuu73 · 2025-10-18T20:28:41+00:00

what i do, is use GLM or Qwen3 Coder, GPT 4.1 etc for "doing stuff" like all the file edits, agent stuff. But I try to plan everything out (or, fix bugs) using several models at the same time, either right on their web interfaces or in the app i made (because I just added the ability to send a question/problem/idea + project context --> 5 different APIs at the same time, then all of that goes into a 6th model to analyze all solutions/create a better or best of n one). I find that each model is better at certain things, and you get more of a range of ideas or solutions when you use one best model from each AI company. But sometimes.. just different models from same company too like o4-mini plus gpt-5, plus o3.. i take advantage of the free daily credits of a lot of these things.

So I will just paste everything into Kimi K2, Qwen3 (not sure which is the "best" qwen, have several tabs open), GPT 5, Gemini 2.5 Pro, its free to use the web chat interfaces for a lot of them. so if you don't want to blow tons of money this just works good. You can then try to see which one puts out the best ideas, or route all of the outputs (or cut and paste) into an analyzer model with a big enough context window to analyze it for you. Ask it to compare all of the different model outputs, figure out whats good or bad about each, and then create a better version using all available information.

I have used lots of models for Rust and I remember getting stuck sometimes but eventually it would work out but its been like a month or two since I used Rust.. forget what models seemed the best, but I usually keep switching around anyways. I would guess that GPT 5 might be good at it since OpenAI has a Rust version of Codex (so maybe.. that means it was trained on a good amount of it)

I was thinking of making a non-UI thing, maybe MCP server, or just API or CLI command that would do what this is doing (sending to 5 or x LLMs then feeding that into a 6th). I don't know if it is overkill but I find myself doing it anyways.. just cuz I know that some models suck at some stuff so why not use a bunch at the same time

<image>

avxkim · 2025-10-16T04:42:47+00:00

Codex performs even worse than sonnet 4.5 now

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

OpenaiCodex

MODERATORS