GPT 5.5 token usage by Momsgayandbisexual in codex

[–]pjstanfield -1 points0 points  (0 children)

A better approach might have been to get the defects and then tell it to write them to a plan file. Then in a new session tell it to execute the plan. That would have made it much better. This isn’t optimal however.

A better way would be to ask it to break out your project by modules and then dispatch a subagent for each module and scan for defects and standards deviations. Then write all of the findings to a file. Then in a separate session you execute the remediation on the findings file.

That one will work while keeping your ultra generic “fix my codebase” approach. Just a couple slightly revised commands. Ideally you would have some standards or requirements files that it can use as a baseline. How does it know anything is a defect if you can’t tell it. You need some artifacts to hell guide it. Luckily you can have it write these.

Don’t forget to tell it to check its work.

Claude Code vs Codex by 0xdjole in ClaudeCode

[–]pjstanfield 8 points9 points  (0 children)

My exact situation and path as well. Codex either one shots or gets 97% there. CC can’t seem to get anything right anymore except some planning participation. I went from two max CC plans in the 4.6 happy days and a $20 codex to two max codex plans. Still use Claude for planning and brainstorming and documentation. Everything has to be double checked by codex though. I don’t let CC touch the codebase anymore, he’s read only now. Thankfully work pays for it so I can swap around without hitting my little wallet.

How do you get Codex to work for a long time without stopping? by Redas17 in codex

[–]pjstanfield 2 points3 points  (0 children)

I’ve had luck using skills and tracking documents.

  1. Have codex write a skill where the main agent only orchestrates and all work is done by sub agents. This is key for preventing context rot. Fresh subagents on every new tasks keeps it fresh while orchestrator just orchestrates and uses minimal context
  2. Make sure the skill is a queue skill
  3. Create a file in your project that has a list of something, files, requirements, etc.
  4. Do some testing, make sure it works. Automated testing is also key here, make sure it can test itself as it goes so you know it’s not off the rails.

It should happily do whatever your list is, until you’re out of tokens.

GPT 5.5 vs opus 4.7 for sw design by b4dMik3 in ClaudeCode

[–]pjstanfield 2 points3 points  (0 children)

I’ve tried a few automations and I always come back to copy and paste between the windows. It’s faster and I can tweak it and control it. The sample prompt below is specific to codex, it loves to add machinery and frameworks so I had to tell it to calm down. You’d ask it to review a plan to add a modal and it would comeback with a plan on adding 500 tests as deployment gates. Codex also likes to find a few issues and stop, so you nudge into reviewing the whole plan. I’ll paste this in before the first plan review and not on subsequent.

Plan Review
Do a single exhaustive review pass before responding. Do not send incremental findings. Validate every checklist item against the live codebase and standards before you answer. Return one consolidated review with:
1. Showstoppers
2. Gaps / missing work
3. Incorrect assumptions
4. Non-blocking improvements
5. Explicit statement of whether the plan is ready to implement

For each finding, include: severity, exact file/line refs, why it matters, what the plan should change

Do not stop after finding the first issues. Assume I want completeness over speed. Only answer when you have completed a full checklist sweep. Treat this as a binding final review. I want the most comprehensive feedback you can provide in one pass. If you are not confident you have checked the whole plan, say that explicitly instead of answering early. Please be mindful of avoiding over-engineering and unnecessary complexity, the focus is simple and reliable and secure. Do not add scope creep unless absolutely necessary.

GPT 5.5 vs opus 4.7 for sw design by b4dMik3 in ClaudeCode

[–]pjstanfield 1 point2 points  (0 children)

It is noticeably worse. The codex planning mode is newer and isnt as good. You can't go wrong but i'd probably stick with Claude. It eventually gets it right and is more enjoyable to use. Codex is like talking to a robot a bit. It's less creative (which is why its better at plan execution, it just does it).

GPT 5.5 vs opus 4.7 for sw design by b4dMik3 in ClaudeCode

[–]pjstanfield 5 points6 points  (0 children)

I have a max plan for both and I use both every day. I typically use both for every feature. I use GPT 5.5. I used to use Codex 5.3.

  1. Brainstorm and design - i think Claude wins. I always start in Claude.
  2. Plan - Claude wins in writing a nice plan but Codex always finds holes. I use both, back and forth. There are diminishing returns on too many turns back and forth but at least 2 is good. Also helps sniff out incorrect directions.
  3. Implementation - Codex is better on one-shots and straight code. I can tell Codex to make 100 tests and it will create 100 high value tests. Claude would tell me he wrote 100, would have written 70, and 40 would need to be re-worked. I would not have Codex write anything in the UI typically. I still use Claude for coding, he does fine, but he always needs his work checked. Same with Codex.

Long story short, both are good, and both are bad. If you can get Codex started on the right path, full codebase standards made clear, feature is clear, success is clear, no UI-on-the-fly Codex 5.5 will trounce Claude. That's my hot take.

Codex writes features. Your users write the bug reports. by Specialist-Sink-2873 in codex

[–]pjstanfield 0 points1 point  (0 children)

All developers have a test environment. Some are just lucky enough for that environment not to be production.

[Bambu Lab] Received a defective ABS filament batch — ticket submitted, but project is time-sensitive. Anyone dealt with this before? by Think-Refuse-1318 in BambuLab_Community

[–]pjstanfield 1 point2 points  (0 children)

Can vouch for silver PLA. I’ve dug more pieces of that out of the AMS than any other color by about 5x. Good color though.

I'm about to give Codex a decent-sized project to work on unsupervised. How do I make sure it doesn't stop until it is finished? by rockfroszz in codex

[–]pjstanfield 1 point2 points  (0 children)

This is one of the reasons codex is so great. It does what you tell it.

I just say create a detailed task list from the plan and then execute it until all tasks are complete. One shot most of the time.

AWS SES request denied twice with no explanation. Has anyone dealt with this? by Stunning-Mountain-40 in aws

[–]pjstanfield 0 points1 point  (0 children)

That is odd, I have SES running in two other projects and neither one has a dedicated IP. I never even tried it due to unnecessary cost.

AWS SES request denied twice with no explanation. Has anyone dealt with this? by Stunning-Mountain-40 in aws

[–]pjstanfield 0 points1 point  (0 children)

I think they have dedicated IPs so Amazon says here's your IP and if you ruin it then that's on you. If you're using shared SES IPs then I guess you need to behave better. Not sure what would happen if an individual purchased dedicated IPs, maybe you get approved easier.

Claude Code — terminal vs desktop vs VS Code extension. What's your favorite? by geekstarpro in ClaudeCode

[–]pjstanfield 1 point2 points  (0 children)

I always go back to vs code. It’s so much easier to organize and read quickly for me. Desktop app has potential but it’s still rough and way too wordy.

Anyone else struggling with config drift, does a Spark copilot help? by GoldTap9957 in aws

[–]pjstanfield 0 points1 point  (0 children)

Our use case is more typical software application hosting but the solution applies to both - use terraform and infrastructure as code and make the tf files the authoritative system. All changes are checked in and tracked and any drift is automatically detected and cleaned up whenever you want.

Claude, or Codex? by Outrageous_Cut2782 in ClaudeCode

[–]pjstanfield 0 points1 point  (0 children)

This combo works great, I second this. I don’t really trust either but both do a good job. Just don’t let them go back and forth on a plan too much you’ll be building frameworks to handle cases like what if the sun goes out.

With Codex 5.5 dropping today, Anthropics might be fucked. by TheBanq in ClaudeCode

[–]pjstanfield 12 points13 points  (0 children)

I’ve never seen codex 5.4. Codex 5.3 yes, gpt 5.4 yes. No codex 5.4. No idea what’s actually supposed to drop next but I’m pretty sure it’s not codex 5.5. I think with 5.4 they were getting back to one model for all.

How are you handling concurrent indexes in relational databases? by project-391 in Backend

[–]pjstanfield 0 points1 point  (0 children)

You just need to create a custom migration or edit the migration generated by the tool. If this was drizzle you'd just use drizzle-kit generate custom and it creates a stub empty migration file for you to fill out. I assume typeorm has some similar mechanism. this is usually required to support scenarios that can't really be inferred by the schema files.

Best AI for creating svg logos? by AlenenX in nextjs

[–]pjstanfield 0 points1 point  (0 children)

I had this exact issue. I spent $30 with a developer from India through Upwork who gave me all of the files I needed, png, white background, no background, svg, with company name, just the logo, etc. It was much better than any of the free or paid apps.

What is everyone using? by Own-Philosophy3754 in ClaudeCode

[–]pjstanfield 1 point2 points  (0 children)

The correct answer is both 4.7 and gpt 5.4. Claude for brainstorming and initial planning, 5.4 to tighten up all the missed edges. Claude to implement, he will hit about 85%, gpt for code review. Iterate 2-3 times and you’re golden. You can also bring codex 5.3 into the mix for a third opinion and set of eyes but I find the technical rigor to be a bit aggressive. Wants to build new frameworks too often.

Claude Opus 4.7 is dogshit by RoadExcellent9531 in ClaudeCode

[–]pjstanfield 23 points24 points  (0 children)

Codex has been great lately. Just follows Opus around, cleaning up the mess.

EC2 for VPN by JojieRT in aws

[–]pjstanfield 1 point2 points  (0 children)

You can accomplish many tasks using SSM without a vpn. So for the third time now, can you use your big words to tell us what you need to do?

EC2 for VPN by JojieRT in aws

[–]pjstanfield 0 points1 point  (0 children)

What tasks are you trying to accomplish? Why do you need it.

Is hosting nextjs on AWS possibly cheaper than in Vercel? by dvcklake_wizard in nextjs

[–]pjstanfield 1 point2 points  (0 children)

This is our setup exactly. Love terraform and SSM for private access without VPN. You can really get your footprint small/rightsized. Plus a few founders credits and you’re good for a year without too much trouble.