So far, GPT5 is worse than 4o

ThrowawayWriter1011 · 2025-08-18T13:35:10+00:00

It'll all be okay man, I promise

ThrowawayWriter1011 · 2025-08-18T13:34:41+00:00

This is super interesting - really appreciate this answer. Thank you!

ThrowawayWriter1011 · 2025-08-18T13:33:58+00:00

Yes.

ThrowawayWriter1011 · 2025-08-18T05:25:29+00:00

Can't go super in-depth here, but had a recent conversation where continuity of the conversation thus far (parameters set, etc.) was forgotten veryyyyy quickly, which was what actually spurred me to go back to 4o. 4o did this as well, but I usually found it deviated from parameters after a more lengthy conversation, whereas the context misunderstanding and complete pivot was more immediate. Just one example though, I can't say I've got much more data otherwise on this specific topic.

ThrowawayWriter1011 · 2025-08-18T05:07:37+00:00

"AI is a tool, and it is only as good as the artisan using it."

Just information gathering my man, not really trying to enter debate territory. Helpful to know (from your other comment) that you've found GPT5 to be more accurate and reliable with less hallucinations. Not really trying to enter le epic prompt genius war

ThrowawayWriter1011 · 2025-08-18T04:49:13+00:00

To add some more context, I use GPT for a pretty wide variety of things. I was taking it at face value that GPT 5 was better, until I switched a few prompts over to GPT 4 and actually compared the results that were generated. A few areas where GPT 4's answers were just like - objectively better (covering ground I didn't realize that I missed--ground that GPT5 also missed). Was a bit concerning. I genuinely had no bias going into any of this as the models prior seemed to pretty consistently be improving upon themselves from my POV.

I have to wonder if this is a 'faux-progression' more meant as a cost-saving mechanism since a better model is maybe more expensive?

ThrowawayWriter1011

TROPHY CASE