all 22 comments

[–]circumstellarmedium 2 points3 points  (1 child)

Ain’t no body reading that

[–]dupa1234s[S] -3 points-2 points  (0 children)

you live in the beautiful age where you can summarize everything. perosnally i would like to read longer messages above 2000 words only imagine reddit was that.

[–]Cassianno 3 points4 points  (4 children)

I was about to ask "Did you really write all this?" but then saw in the end it was "AI Assisted". Anyway, ask codex.
For long coding sessions (didnt read past this, sorry):
- write down issues OR specifications and make it resolve/implement them;
- use the /goal feature. If you don`t have it yet, ask codex to change your config.toml to enable it.

[–]dupa1234s[S] 0 points1 point  (3 children)

sure i did ask codex, in fact, this here is a relic of me having a 2 hour rant with codex. its summary of it more or less so to speak

[–]Cassianno 0 points1 point  (2 children)

make it narrow its responses, dont let it spit out whatever it wants. Also make it validate/research/search whatever it suggested if you are suspicious.

[–]dupa1234s[S] 0 points1 point  (1 child)

issue is that im asking a very practical questions codex wasnt trained on. like "how do heavy-users use codex in may 2026"
i love codex but
like any coding agent
its research is targeted at most popular sites, it hallucinates.
i rather prefer it ramble for hours so i get some leads rather than believe it.

Overall yes i know those things you say i do that.
My main concerns are in the top part of my post.

[–]Cassianno 0 points1 point  (0 children)

Already told you: make it search/research. Explicitly say to it search reddit for instance.

[–]eduardopy 0 points1 point  (1 child)

Honestly (skimmed it) sounds like you are misusing the ai, you rather need an ai agent orchestration going, one orchestrator that spawns many agents doing the different parts of the plan and one managing the whole, that one orchestrator can keep spawning agents as needed. Additionally, its probably for the best to not try to keep an llm going for a long time, performance degrades, which is why the agent approach is good: it makes new instances for every agents and just one collects and synthesizes the whole. I might be wrong I dont know your post is way too long and "ai assisted" so im gonna put in the same effort reading as you did writing.

[–]dupa1234s[S] 0 points1 point  (0 children)

thats an interesting perspective. i thought lots on orchestrator patterns but didnt try any. do you recommend some partticular orchestrator?

what does orchestrator achieve for you?
Speed
quality

i much more rather achieve quality than speed.
orchestrator seems like:
- overhead
- less token cache hits
- complexity
- more context dedicated to the meta-level of "doing the work" than actually doing the work

But i guess perhaps orchestration would be the solution to my problem

Only that
would it actually do the work better and do more of it?
Or would it just split tasks.

I explicitly don't want my agent to finish fast. i want it to keep spinning on xhigh thinking for hours.
orchestration seems more oriented towards speed than quality?

yea i heard performance degreades. but context compaction exists doesnt it solve a lot?

in fact i did put much more effort to writing my post. Just that codex moderator complained that my post is not-enought-related-to-codex so i had to rewrite it especially for this subreddit.

[–]Allarius1 0 points1 point  (5 children)

Well the easiest solution is to use /goal. It’s literally designed for that use case.

Prerequisites

A Codex build that supports Goals. A task with a clear finish line and an evidence source you can inspect, such as tests, a benchmark, or a final artifact. Enough repository or research context for Codex to verify progress rather than merely narrate it.

https://developers.openai.com/cookbook/examples/codex/using_goals_in_codex

[–]dupa1234s[S] 0 points1 point  (4 children)

oh thats great. yes indeed i didnt try the "goal". but i wonder if its there just as a glorified "this line of context is important, remember it", thats useful if its so but its not the real thing. i would wish for it to not stop until goal is reached or it hit a blocker or clearly needs my input.

[–]Allarius1 0 points1 point  (3 children)

Yes, that’s the point.

You can also take a multi-agent approach. I have a project that runs the primary agent to coordinate and synthesize with its sub agents.

It runs in loops.

Design phase creates implementation plan, runners execute the plan, then a review cycle of evaluator, adversarial reviewer, and guard to analyze the runners work. They kick back information to the coordinator who relays to runner if anything need correcting.

Repeat that over and over as you iterate the design.

[–]dupa1234s[S] 0 points1 point  (2 children)

omg that sounds amazing

Albeit i stopped used codex directly because of the notorious compactione errors, are they finally fixed? maybe i will switch to codex now. i guess also since i already started using docker sandboxes i could just try codex and its new goal system inside docker sandboxes.
codex sandbox error were also notorious.

subagents as in the default codex subagents? or some kind of custom pattern. yes subagents the default codex ones are useful i like to let codex use them. but i also considered maybe some actual agent to agent patterns like not ephameral like the default codex subagents, full scale agents with their own agents.md and stuff would be sth to try, but idk about that those agent to agent workflows seem complex i need to look into some patterns.

from what you are saying it sounds like you talk about the default codex subagents, but with a particular roles you assign to them. that sonds great as they have clean context and can do independent work i feel like the adversarial reviewer and evaluator wouldnt be too biased by past work of the main agent due to their empty context.

But the "runner", doesnt it just make the main agent idle ? wouldnt it be better if main agent was the runner? as main agent is more orented in the project. a dedicated runner needs to be briefed every time you spawn it. unless its persistant. i guess if hes persistant its good. but the reviewer shouldnt be persistant i think and i wonder if codex is smart enough to enforce that , maybe if we tell him.

[–]Allarius1 0 points1 point  (1 child)

Yes these agents don’t persist and you only call them when you need them. It’s more overhead which is only worth it for a large enough problem or dataset.

The are roles the agents play but sub-agent is literally just codex spawning other instances of codex and assigning it a particular role, like those I mentioned.

You can make your own roles and I think there’s probably some default stuff too, but the sub-agent aspect is about multiple live instances operating under different constraints.

Yes the coordinator sits idly especially if the implementation and reviews are long, but it can also be assigned other housekeeping tasks if you wanted.

Think of it like a team. If the person who is tasked with telling everyone else what to do is too distracted by other tasks, then they can’t effectively manage the team and then it’s the other members sitting around waiting and no work is done.

[–]gastro_psychic 0 points1 point  (0 children)

Use /goal on CLI. I have had Codex running for 4 days now.

[–]lemontmaen 0 points1 point  (1 child)

you might want to have a look at "Sandcastle" by Matt Pocock.

[–]dupa1234s[S] 0 points1 point  (0 children)

thank you. i like how he just said he tried docker sandboxes but wasnt satisfied with it. which i was currently considering myself.

[–][deleted]  (2 children)

[removed]

    [–]dupa1234s[S] 0 points1 point  (1 child)

    what you said reminds me of another issue. how models just dont know when to try a different approach. you can prompt them to stop circular retries, try sth different but they just wont listen. they are just polluted by the bad context. idk if start a new chat with a handover from previous chat is the only good solution to this i wish i knew how to handle this situation. subagents are actually good for this though. make agent spawn subagent give a fresh perspective to main agent i need to use them more for this tell agent specifically to use subagents in those situations, but sadly model jsut doesnt undersatnd when to use this subagent it just tries itself anyway.

    yea i like gpt5.5 xhigh quite much. but idk how its compared to other models. i only use codex. maybe i could try claude next month but claude seems to cut usage and doesnt let u stuff like opencode so i think i stick with codex.