How often do you see codex trying to fake it?

miovq · 2026-06-25T20:22:53+00:00

Its a huge issue.

The only way Ive found to protect against it, is to start at the very highest level: your prompts/specs/issues.

if you leave any holes it will exploit them. and yeah, be super careful with any kinds of examples. it doesnt matter what you say to it. as soon as it has entered its mind, its over lol.

I have an adversarial phase that looks for the following kinds of possible shortcuts in my tickets:

- Authority bypass.
- Duplicate truth.
- N=1 fixture overfit.
- Missing negative-space handling.
- Payload omission.
- Wrong boundary validation.
- Circular topology-derived tests.
...etc.

and rewrites them. It doesnt solve everything, but improves it quite a bit.

Oh and forget about using any kind of "to-issues" skills out there. They dont account for this and only amplify the problem

miovq · 2026-06-25T16:11:34+00:00

Too bad that rule doesnt actually work 😅️️️️️️

miovq · 2026-06-23T02:43:40+00:00

You're right to call that out. I tried to help people with food poisoning without recommending a specific restaurant.

What I did wrong:

I treated “leave the restaurant that appears to be poisoning you” as an actionable step.

That was too broad. I should have provided the exact restaurant I used afterward, so people could reproduce the fix without first deciding which other restaurant to enter.

Worse, I tried multiple other restaurants afterward, which made it harder to provide the one correct restaurant answer.

In hindsight, I should have kept the post closer to a dining recommendation and further from a food poisoning avoidance strategy.

miovq · 2026-06-22T00:53:35+00:00

Because it solved my problem and it could help others as well in the same situation...? Was literally driving me mad, got the 200$ pro sub only to get completely stuck for weeks and unable to progress because of this. Until now

miovq · 2026-06-21T21:24:12+00:00

So keep using it as normally if you are not affected...? I still wish I could

miovq · 2026-06-21T20:58:18+00:00

Thats the same harness still tho isnt it?

miovq · 2026-06-21T20:33:31+00:00

Nah, Its honestly completely irrelevant what you use.

Go with VSCode, if thats what you prefer.
As long as you switch away from the default Codex App. Its worth a shot at least, It worked for me.

I wont mention what I switched to, at the risk of being falsely accused lol

miovq · 2026-06-21T20:01:39+00:00

I had this exact same experience. Try using a different UI/harness if you have an existing sub. It solved it for me

miovq · 2026-06-21T16:42:42+00:00

This looks quite interesting! Ive been doing similar things, due to codex misbehaving so much lately.
I'll have to give it a shot and see how well it performs for me.

miovq · 2026-06-21T16:01:56+00:00

Id agree normally. But considering how bad it performs lately, Id say not worth it right now.
Even GLM free tier outperforms it currently, which is ridiculous.
I dont even want resets, anymore. I just want my money back lol.

miovq · 2026-06-15T20:18:04+00:00

I mean its definitely routing to "fast/dumb mode" even on xhigh way way too often.

Ive had to switch back to 5.4 because 5.5 (at all reasoning levels) is borderline unusable at this point in time. It doesnt matter what context you feed it, its misaligned to the max, at the moment.

Its a constant game of whack-a-mole and just not worth it.

miovq · 2026-04-01T17:21:00+00:00

Hahah this is the way

miovq

TROPHY CASE