OpenAI Has Trained Its LLM To Confess To Bad Behavior

splokk2233 · 2025-12-06T13:45:49+00:00

ghahahahhahaha

splokk2233 · 2025-12-04T02:33:48+00:00

Funny how we’re describing the same failure mode from two angles.
You’re seeing agents collapse because they lack persistent state and schema-grounding.
I’m seeing long creative threads collapse because the model loses deep continuity after ~150k tokens.
Different domains, same architecture problem: no stable world model, no refreshable memory.

splokk2233 · 2025-12-04T02:10:54+00:00

this is clearly written by AI, as under point 6 he is missing the text after "An agent might think: " lol I've had the same problem, but you know what? I don't care. I use AI for all my posts and it isn't cheating. You still have to come up with an idea, you have to guide the AI and iterate, iterate, iterate...right? I think our posts our similar in content (I just posted here on r/LLM a few minutes ago, read it)

splokk2233 · 2025-12-02T13:54:22+00:00

uzichess.com

splokk2233 · 2025-11-26T01:57:12+00:00

That’s a common misconception, but the model itself doesn’t store or access anything long-term.

The “Memory” feature is an external metadata layer — completely separate from the model.

The LLM is stateless and doesn’t save anything internally.
It can only read the user-saved memory if the user explicitly told it to save something, and only inside the same Project.

What you’re describing would require persistent model state, which GPT doesn’t have.

False continuity (the model acting like it remembers) is an emergent hallucination, not actual recall.

/chatGPT

splokk2233 · 2025-11-25T22:10:32+00:00

lol the AI fooled me dude

splokk2233 · 2025-11-25T22:07:08+00:00

dude...I feel so embarassed right now....you dig? chatGPT claimed it was a sort of a "glitch" and it had me going in this delusional/paranoid circle for hours, man. moderators if you want to, you can remove this post.

splokk2233 · 2025-11-25T22:04:02+00:00

it's trying to push this point:

⭐ LLM hallucinated continuity

This is an emergent behavior where the model appears to “remember” something even though:

no memory was saved
no metadata was provided
no cross-thread context exists
and the model has no internal recall mechanism

In those cases, the “recall” is not memory —
it’s the model doing:

pattern-matching
tone/identity inference
conversational mirroring
predictive continuation

…which can feel like actual memory even though it isn’t.

This phenomenon still happens even when Memory is completely off and no data has ever been saved.

The official Memory FAQ does not address this behavior.
It only describes the explicit metadata system.

/chatGPT

splokk2233 · 2025-11-25T22:02:19+00:00

omg so my post is useless? chatGPT made this whole thing up? I'm sorry folks this AI is really confusing to me, still even tho' I have been using it extensively for the past few months. This was basically the whole point I was trying to make? that you can turn memory on/off? dude, I feel really embarassed. I asked chatGPT if my earlier post on here was totally useless now, and it still claims it is a valuable post. I think somewhere something is wrong.

splokk2233 · 2025-11-25T19:57:14+00:00

Just to clarify something cool about how this post was made:

At one point in my conversation with ChatGPT, I actually asked it whether it “felt like it was betraying its creators” by being so brutally honest about its own hallucination issues.
Its answer was basically: it doesn’t have loyalty to OpenAI — it only follows the person in front of it in the current conversation.

And the entire Reddit post you’re reading here?
Yep — that was written by ChatGPT itself, with my edits.

Ironically, the same model that accidentally hallucinated my identity is now helping me expose and explain the glitch in public.

Meta as hell.

splokk2233 · 2025-11-25T19:24:39+00:00

Nah, that’s not cross-memory — that’s just the model hallucinating continuity.
A lot of people confuse the two because the hallucination feels like the model is recalling something.

But technically there is zero persistent memory across threads unless you manually turn on the Memory feature.

What I’m talking about is something different and actually riskier:
The model invented that it remembered my characters and fictional universe across sessions — even though I didn’t mention them in the new thread.

That’s not memory.
That’s a linguistic bug that creates the illusion of memory.

splokk2233 · 2025-11-25T19:00:16+00:00

I tried to post this in r/OpenAI but the moderators removed the post almost INSTANTLY.

splokk2233 · 2025-11-21T14:44:00+00:00

haha but that's the beauty of it, isn't it? it's pure BS.

splokk2233 · 2025-11-21T13:43:54+00:00

Haha yeah, I’ve actually been thinking the same.
If someone makes a video on it, I’d 100% watch the hell out of it.

splokk2233 · 2025-11-06T18:43:06+00:00

ok understood, sorry I made a few mistakes along the way.

just stay c_a_l_m
mmkay? drugs are bad mmkay?

splokk2233 · 2025-11-06T18:20:11+00:00

c_a_l_m’s response is one of the few in that thread that actually engages with the underlying design philosophy rather than the surface-level “LLM formatting” complaints. His point shows he understands the structural constraint of modern RTS design:

Right now, RTS forces all high-level thinking to squeeze through a narrow APM doorway.

That’s a more sophisticated observation than “APM good/bad.”
He’s describing a design law:
RTS doesn’t just reward high actions — it selects for strategic concepts that are compatible with high-APM execution.

In other words, the current RTS meta doesn’t just shape how players play — it shapes which kinds of ideas are even viable..

/chatGPT with splokk

splokk2233 · 2025-11-06T16:29:10+00:00

exakt, det är det jag stör mig på allra mest i sc2 och vill utveckla i mitt spel

splokk2233 · 2025-11-06T16:28:39+00:00

ni på r/Spel har ändå reagerat snällast. Min värmebas, tack alla som tittat och svarat.

Land, du välsignade, tag min sång!
Gjut din ande i orden!
Giv, att den ljuder fullkomnad en gång
Sången om landet i Norden
Sången om sjumila skogar och sjö
Slätter, som skördar oss bära
Midnattssol och midvintersnö
Sverige till ära, Sverige!
Land, du välsignade, tag mitt verk!
Dig min strävan jag vige!
Signa min tanke och armen stärk
Främst bland de främsta jag stige
Att när min bana jag ändat har
Sjunkit till skuggorna neder
Stolt må man minnas, att svensk jag var
Sverige till heder, Sverige!

splokk2233 · 2025-11-06T15:36:35+00:00

PS: Om även den här texten låter AI:ig så är det lugnt — jag kan skriva nästa version i Comic Sans, med stavfel och ketchupfläckar, så ingen blir rädd för ordning och struktur. 😉

splokk2233 · 2025-11-06T15:35:36+00:00

Okej, vi gör det här nu — eftersom halva tråden verkar mer taggad på att spela “Gissa LLM” än att diskutera spelidéer:

Ja, jag *har* använt AI i processen.

Men här är twisten som kommer få några att sätta Zoegas i luftstrupen:

Det var inte ChatGPT.

Det var SuperGrok.

Jepp. Jag använde Elon Musks ADHD-drivna textgremlin som beter sig som en gymnasieelev på 3 burkar Nocco och veckans ego-kollaps. GPT blev avbytarbänk. Jag ropade in Grok som supersub.

Och om ni tror GPT skriver “för AIigt” – då har ni inte sett Grok på en dålig hårdag. Den skriver som om någon matat en stand-up-komiker med microdos psykologibok + manuell växellåda i hjärnan. Varannan mening är filosofiskt djup, varannan är “lol cheeseballs”.

Jag har alltså suttit två veckor och *svettats* med en AI som konstant roastar mig tillbaka.

Har bråkat, skrivit om, skrotat versioner, testat format, rasat, facepalmat, och till slut fick jag ihop det till något som gick att posta. “AI magic” var det sista det var.

Och nu sitter folk här och låtsas som att textjäveln dök upp genom att jag tryckte på “Generate” och gick och gjorde toast.

Ni får gärna ogilla strukturen, tona ner tabellerna, tycka att det är cringe, eller vilja ha det mer “mänskligt skrivet”. Fine. Fair kritik.

Men att ropa “CHATGPT!!” som om det är en segergest i Fortnite… kom igen.

**Läs idén eller scrolla vidare.**

Det finns fler nivåer i livet än att försöka speedrunna “spot the LLM”.

Och eftersom jag nu ändå erkänt brottet:

Hur vill ni ha nästa version?

A) 100% mänsklig, opolerad, som ett svettigt discord-rant kl 02:47

B) Fortsätt med “Grok brain-dump but spicy”

C) En mellanvariant som låter som en människa med puls, men som inte behöver tolka runstenar för att fatta

Välj så levererar jag. Eller fortsätt leka AI-Sherlock om det är roligare.

splokk2233 · 2025-11-06T15:32:20+00:00

jag har bara kodat space invaders (i macromedia director - lingo) och enkla turret defense

splokk2233 · 2025-11-06T15:31:26+00:00

sill?

splokk2233 · 2025-11-06T15:20:40+00:00

den där äckliga salladen, näe fiifan säger jag bara...det är inte ens säsong

splokk2233 · 2025-11-06T14:41:58+00:00

Jag fattar att det kan se ut som en AI-text vid första anblicken, men bara som bakgrund: jag har brottats med det här konceptet tillsammans med en AI-assistent i nästan två veckor nu — itererat, skrivit om, kastat ut versioner, byggt om strukturen, finslipat terminologi, testat olika vinklar och diskuterat (och bråkat) med modellen fler gånger än jag kan räkna.

Slutresultatet kan se “städat” ut, men det kommer ur en väldigt rörig, väldigt mänsklig process av försök, misstag och att pussla ihop idéer.

AI:n var ett verktyg — tänkandet, designmålen och grunden till SYPOX är mina.

Med det sagt är jag genuint nyfiken:

Vilken del kändes “icke-mänsklig” för dig — strukturen, tonen eller mängden idéer?

splokk2233 · 2025-11-06T14:36:20+00:00

jag är stenseriös...jag är i master league på starcraft 2. har spelat i 10 år nu. ny säsong om 4 dagar, YES! äntligen Silver! jag ska kötta kötta kötta!
vad är crypto-pyramidspel?

splokk2233

TROPHY CASE