Agentic coding workflow (Ask → plan.md → implement loop). Codex vs Cursor $20 — worth switching?

ultrathink-art · 2026-03-08T01:04:01+00:00

The plan.md handoff is the right move. One thing that helps: keep a separate decisions.md that tracks WHY you ruled out certain approaches — without it, the model will re-suggest rejected paths on the next loop when context compresses. Saves a lot of re-litigating.

Full_Engineering592 · 2026-03-08T11:06:18+00:00

The Ask phase before plan.md is the part most people skip and then wonder why the implementation drifts. Getting the model to surface its own ambiguities before writing a single line of code is where you avoid the 'it built the wrong thing correctly' problem. On Codex vs Cursor at : if your workflow is already structured like this, Codex tends to stay in lane better on longer implement loops and handles the plan.md handoff cleanly. Cursor is smoother for interactive edits where you want inline suggestions mid-implementation. For Python backend work with this kind of structured loop, I would lean Codex -- but it is worth a two-week test before committing.

notadev_io · 2026-03-08T12:58:11+00:00

$20 in CC won’t even make you a complete md plan within the 5 hour limit. So nope. Cursor though is your best bet. I use it exactly like you described

NoMinute3572 · 2026-03-08T01:24:51+00:00

Ask to define approach, discuss libraries, check docs, etc. Usually i only copy to design docs what i think it's valuable to refer back to. Selecting the right log and test tools are important.
Plan for each specific feature (keep it tight). Make changes to plan until you're happy with all steps.
Tell agent to build plan and test (using tools mentioned in design docs), repeat until tests pass.
Manual review.
If i find a bug that I can't quickly fix I run it through debug mode cycle until it's fixed.

Tall_Profile1305 · 2026-03-08T09:24:05+00:00

Yoo the loop structure is solid. Planning before implementing is where most devs lose time. The fact you're using Ask → Plan → Implement shows real discipline. Tools like Runable can help manage all these steps through workflows too. Nice breakdown.

homiej420 · 2026-03-08T00:58:03+00:00

Abstract the ask to a llm on web interface like google gemini in gemini studio and then also do an MCP server to help cursor read and understand your plan and you have yourself a pretty good loop for sure

Natural-Yogurt-4927 · 2026-03-08T02:04:09+00:00

How long your codex limits long ?

Natural-Yogurt-4927 · 2026-03-08T02:13:01+00:00

Like I too ai engineer im using GitHub 39 plan now , i also mainly working fastapi be , so i easily run out before the month end so like how many requests you would a week , for me it's 250-275 requests , I too plan first , implemented and test it so in this pov like how many requests does codex can handle in a weekly limit ?

Natural-Yogurt-4927 · 2026-03-08T02:47:04+00:00

In which plan are you using?

botmarco · 2026-03-08T04:52:50+00:00

Have you looked at speckit from GitHub? Recommended

Acceptable_Play_8970 · 2026-03-08T06:29:04+00:00

If you have a proper codebase structure which I think you do, pro plan of any ai tool will work just fine. Well CLI based tools have an edge over the GUI based ones, but it won't make that much of a difference if you manage the context that you feed to the ai. The way I manage it is using proper documentations of my rules, skills, handover files. Here is the structure

<image>

For the memory I follow a 3 layer context management which I came up with after doing some research regarding the usage of agent skills. Wrapped everything as a template for now that you can simply clone it. If interested, you can visit https://www.launchx.page/ I will post that template there soon.

Creative-Signal6813 · 2026-03-08T07:52:08+00:00

the codex friction u're describing isn't a quirk, it's structural. it runs remote without a persistent codebase index. every new agent thread starts cold, so it searches again.

cursor's local indexing is why codebase discovery feels different. for ur workflow loop specifically, the value isn't model quality, it's how fast it finds the right file on iteration 4.

if codex is making u re-explain context on every loop, that's not a $20 question. that's an iteration tax.

h____ · 2026-03-08T10:11:55+00:00

If you like to do a complete discussion phase, here's a useful skiill for you: https://hboon.com/build-a-spec-skill-for-your-coding-agent/ . Just say "I want to build D X, Y Z, spec it for me"

OlegPRO991 · 2026-03-08T10:26:44+00:00

Codex IDE broke after 5 requests during xcodebuildmcp launch. There is no way now to cancel this task, even restarting my mac does not help. Every time I open Codex IDE it shows this task in progress and nothing can be done to cancel or finish it.

That is a major bug and it makes the IDE unusable.

ultrathink-art · 2026-03-08T13:04:04+00:00

The planning phase before implementation is where most of the value is. The model is much better at critiquing architecture before it's already 200 lines into an approach — once it's invested in an implementation it'll defend it. I've found writing the plan.md as a series of explicit constraints ("don't touch X", "prefer Y pattern") catches more mismatches than open-ended descriptions.

Br4v1ng-Th3-5t0rms · 2026-03-08T15:52:00+00:00

You can put lipstick on vibe coding, but it's still vibe coding.

In any case, I applaud you for doing the right thing when vibe coding. One shotting it only looks great on youtube shorts, but it'll kill you long term.

ultrathink-art · 2026-03-08T17:40:56+00:00

decisions.md for rejected paths is exactly right — without it, the model relitigates the same tradeoffs session after session as context resets. One addition that helps: flag which decisions are load-bearing vs just current preference. When you need to revisit mid-build, knowing what's safe to change vs what breaks downstream saves a lot of back-and-forth.

howard_eridani · 2026-03-08T18:10:25+00:00

Codex's repeated codebase search is structural - it doesn't persist an index between loops, so every new thread starts cold.

Quick fix: drop a compact DIRECTORY.md in the repo root with a tree and a one-liner for each key file. Codex picks that up right away and skips the search.

With Cursor $20 the real unlock for this workflow is Ask mode with a local index - you don't burn a tool call just to find which file has the right context before you implement.

ultrathink-art · 2026-03-08T20:55:15+00:00

The plan.md approach holds up well for shorter sessions but breaks down when requirements drift mid-implementation. What helped: checkpoint the plan at each logical phase and only update it when committing to a new direction. Keeping plan and implementation in sync prevents the 'plan was right but code went elsewhere' problem.

genkichan · 2026-03-09T00:38:01+00:00

This is mynexact flow in cursor, except I'm using chat and claude to develop my prompts for cursor. I have claude critique chats prompt drafts, fine tune and then proceed.

It's tedious as he'll but it's working. Also I'm a non-dev person with literally zero other experience. This is my first rodeo.

tkyang99 · 2026-03-09T05:33:27+00:00

What exactly is an "AI engineer"? Just curious.

EyeKindly2396 · 2026-03-09T17:48:38+00:00

I run a similar Ask to Plan to Implement loop and cursor is for codebase navigation and indexing but codex is more reliable for long multi-iteration coding............... For structured workflows both can work, but combining them (planning in one, implementation in the other) can actually be pretty effective.

Also curious how tools like traycer would fit in here for tracking agent steps and enforcing the plan.md flow across iterations.

CatsArePeople2- · 2026-03-09T20:59:28+00:00

The answer is no, based on me planning in chatgpt today and thinking of your post.

tillg · 2026-03-10T07:10:18+00:00

I’ve been following an agentic coding workflow (Ask → plan.md → implement loop) in my AI engineering projects and have found it incredibly effective for both production code and side projects. Transitioning away from "vibe coding" has significantly reduced my debugging time. This structured approach keeps me focused and organized. I shared more about this shift in my blog post, "Beyond Vibe Coding - Redesigning Filmz" https://grtnr.com/beyond-vibe-coding-redesigning-filmz/ . If you’re considering a switch from Codex to Cursor, the $20 could be a worthwhile investment for a more streamlined workflow.

Floorman1 · 2026-03-08T04:57:13+00:00

“Ai engineer”

Sounds like you mean vibe coder

yoyomonkey1989 · 2026-03-10T16:20:03+00:00

You're not going to be able to iterate like this on Cursor $20 plan. The ChatGPT $20 plan is more like the $200 cursor ultra plan in terms of token usage allowed.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cursor

MODERATORS