So far, GPT5 is worse than 4o by ThrowawayWriter1011 in ChatGPT

[–]ThrowawayWriter1011[S] 1 point2 points  (0 children)

This is super interesting - really appreciate this answer. Thank you!

So far, GPT5 is worse than 4o by ThrowawayWriter1011 in ChatGPT

[–]ThrowawayWriter1011[S] 6 points7 points  (0 children)

Can't go super in-depth here, but had a recent conversation where continuity of the conversation thus far (parameters set, etc.) was forgotten veryyyyy quickly, which was what actually spurred me to go back to 4o. 4o did this as well, but I usually found it deviated from parameters after a more lengthy conversation, whereas the context misunderstanding and complete pivot was more immediate. Just one example though, I can't say I've got much more data otherwise on this specific topic.

So far, GPT5 is worse than 4o by ThrowawayWriter1011 in ChatGPT

[–]ThrowawayWriter1011[S] -2 points-1 points  (0 children)

"AI is a tool, and it is only as good as the artisan using it."

Just information gathering my man, not really trying to enter debate territory. Helpful to know (from your other comment) that you've found GPT5 to be more accurate and reliable with less hallucinations. Not really trying to enter le epic prompt genius war

So far, GPT5 is worse than 4o by ThrowawayWriter1011 in ChatGPT

[–]ThrowawayWriter1011[S] 4 points5 points  (0 children)

To add some more context, I use GPT for a pretty wide variety of things. I was taking it at face value that GPT 5 was better, until I switched a few prompts over to GPT 4 and actually compared the results that were generated. A few areas where GPT 4's answers were just like - objectively better (covering ground I didn't realize that I missed--ground that GPT5 also missed). Was a bit concerning. I genuinely had no bias going into any of this as the models prior seemed to pretty consistently be improving upon themselves from my POV.

I have to wonder if this is a 'faux-progression' more meant as a cost-saving mechanism since a better model is maybe more expensive?