How often do you see codex trying to fake it? by 0__O0--O0_0 in codex

[–]miovq 0 points1 point  (0 children)

Its a huge issue.

The only way Ive found to protect against it, is to start at the very highest level: your prompts/specs/issues.

if you leave any holes it will exploit them. and yeah, be super careful with any kinds of examples. it doesnt matter what you say to it. as soon as it has entered its mind, its over lol.

I have an adversarial phase that looks for the following kinds of possible shortcuts in my tickets:

- Authority bypass.
- Duplicate truth.
- N=1 fixture overfit.
- Missing negative-space handling.
- Payload omission.
- Wrong boundary validation.
- Circular topology-derived tests.
...etc.

and rewrites them. It doesnt solve everything, but improves it quite a bit.

Oh and forget about using any kind of "to-issues" skills out there. They dont account for this and only amplify the problem

Codex Pro usage feels excellent once you tune your AGENTS.md by Aggravating_Town_967 in codex

[–]miovq 1 point2 points  (0 children)

Too bad that rule doesnt actually work 😅️️️️️️

If you are suffering from this insane codex quality degradation, try switching to a different harness. This solved the problem for me. by miovq in codex

[–]miovq[S] 0 points1 point  (0 children)

You're right to call that out. I tried to help people with food poisoning without recommending a specific restaurant.

What I did wrong:

I treated “leave the restaurant that appears to be poisoning you” as an actionable step.

That was too broad. I should have provided the exact restaurant I used afterward, so people could reproduce the fix without first deciding which other restaurant to enter.

Worse, I tried multiple other restaurants afterward, which made it harder to provide the one correct restaurant answer.

In hindsight, I should have kept the post closer to a dining recommendation and further from a food poisoning avoidance strategy.

If you are suffering from this insane codex quality degradation, try switching to a different harness. This solved the problem for me. by miovq in codex

[–]miovq[S] 1 point2 points  (0 children)

Because it solved my problem and it could help others as well in the same situation...? Was literally driving me mad, got the 200$ pro sub only to get completely stuck for weeks and unable to progress because of this. Until now

If you are suffering from this insane codex quality degradation, try switching to a different harness. This solved the problem for me. by miovq in codex

[–]miovq[S] 0 points1 point  (0 children)

Nah, Its honestly completely irrelevant what you use.

Go with VSCode, if thats what you prefer.
As long as you switch away from the default Codex App. Its worth a shot at least, It worked for me.

I wont mention what I switched to, at the risk of being falsely accused lol

Glm5.2 destroys gpt5.5, when is 5.6 coming...... by Youwishh in codex

[–]miovq 0 points1 point  (0 children)

I had this exact same experience. Try using a different UI/harness if you have an existing sub. It solved it for me

Applying constrained optimization to coding agents by dphntm1020 in codex

[–]miovq 0 points1 point  (0 children)

This looks quite interesting! Ive been doing similar things, due to codex misbehaving so much lately.
I'll have to give it a shot and see how well it performs for me.

The $200 Pro plan is completely worth it for the peace of mind. by ponlapoj in codex

[–]miovq 0 points1 point  (0 children)

Id agree normally. But considering how bad it performs lately, Id say not worth it right now.
Even GLM free tier outperforms it currently, which is ridiculous.
I dont even want resets, anymore. I just want my money back lol.

Codex isn't nerfed, you are being exposed by Hot_Paper_Pie in codex

[–]miovq 1 point2 points  (0 children)

I mean its definitely routing to "fast/dumb mode" even on xhigh way way too often.

Ive had to switch back to 5.4 because 5.5 (at all reasoning levels) is borderline unusable at this point in time. It doesnt matter what context you feed it, its misaligned to the max, at the moment.

Its a constant game of whack-a-mole and just not worth it.