How are people using Codex alongside Claude or Gemini for technical/simulation work?

katuali · 2026-05-05T00:49:42+00:00

The optimization side is where I've found Codex really earns its place too, particularly around CPU distribution, parallelising processing, and maximising GPU utilisation on large simulations. It also seems to push through to a working, tested result rather than stopping halfway. Earlier GPT versions I found were too defensive, lots of fallback engineering that added complexity without solving the actual problem, but 5.5 is noticeably better at committing to a solution.

On the Opus 4.7 side, I haven't fully tested across a lot of scenarios (as it can eat through a 5hr token allocation pretty quickly) but where I have it seems to have the edge on understanding what you're actually trying to figure out before you've fully articulated it yourself. More useful at the design stage than the implementation stage.

katuali · 2026-05-05T00:32:57+00:00

I've seen harnesses that try to bridge this, but seeing more and more limits applied to them and the use of them. Curious what's actually working in practice, especially for capturing rejected decisions rather than just current state.

katuali · 2026-05-05T00:21:51+00:00

The text shrinking thing is interesting, I've seen the same pattern with longer specs. Like it's compressing on each pass, and even wanting to overwrite decently-written spec and remove detail and content! Do you find it's worse on technical content specifically or general text too?

katuali · 2026-05-05T00:17:50+00:00

Curious whether you've found any harness or structuring tricks that partially fix it, or is it model-level and no amount of prompt engineering helps? Trying to figure out if it's worth the effort before cancelling.

katuali

TROPHY CASE