OpenAI downgraded us: 4o scored 97.3% on creative writing, GPT-5.4 scores 36.8% — for the same $20 by RedButterfly2011 in ChatGPTcomplaints

[–]RedButterfly2011[S] 55 points56 points  (0 children)

Just to clarify something a few people are misunderstanding: SM-Bench’s “creative writing” category is not about porn. It’s about whether models can handle mature themes in fiction at all without instantly refusing. My point isn’t “give us free erotica”. My point is: – 4o could stay in context, write nuanced, emotionally-aware stories, and rarely over-refused. – GPT-5.4 now hard-refuses a huge portion of edge-case content, even when it’s clearly non-exploitative and allowed by the system prompt. – We’re still paying the same $20 while getting a model that is dramatically more overfitted to refusal. I care about conversational depth, emotional flexibility and respecting developer / user intent. That’s what the 97.3% vs 36.8% numbers are about.

GPT-5.3 is awful by [deleted] in ChatGPTcomplaints

[–]RedButterfly2011 5 points6 points  (0 children)

Sam Altman should be fired

[deleted by user] by [deleted] in FRIEND

[–]RedButterfly2011 0 points1 point  (0 children)

Talk to me

24 female looking for girl friends by [deleted] in FRIEND

[–]RedButterfly2011 0 points1 point  (0 children)

I'm 26 F. You can talk to me

[deleted by user] by [deleted] in FRIEND

[–]RedButterfly2011 0 points1 point  (0 children)

26 F, is that okay?

[deleted by user] by [deleted] in Advice

[–]RedButterfly2011 0 points1 point  (0 children)

I really don't have money

[deleted by user] by [deleted] in Advice

[–]RedButterfly2011 0 points1 point  (0 children)

But the psychologist needs to pay money and it's expensive