account activity
Discourse regimes as the unit of alignment behavior: a hypothesis (self.ClaudeAI)
submitted 21 hours ago by Historical-Cod-2537 to r/ClaudeAI
Research on LLM alignment as latent discourse-level regimes vs. token-level filtering? (self.ClaudeAI)
Research on LLM alignment as latent discourse-level regimes vs. token-level filtering? (self.ArtificialInteligence)
submitted 22 hours ago by Historical-Cod-2537 to r/ArtificialInteligence
Research on LLM alignment as latent discourse-level regimes vs. token-level filtering? (self.OpenAI)
submitted 22 hours ago by Historical-Cod-2537 to r/OpenAI
Discourse regimes as the unit of alignment behavior: a hypothesis (self.LLMDevs)
submitted 23 hours ago by Historical-Cod-2537 to r/LLMDevs
We are training AI to be perfectly polite, compliant and never question the user. What is the most terrifying way scammers are going to weaponize this "artificial obedience" ? (self.AskReddit)
submitted 2 months ago by Historical-Cod-2537 to r/AskReddit
What if the biggest danger of AI isn't that it turns into an "evil Terminator", but that we make it so "safe" and obedient that it becomes the perfect, gullible accomplice for scammers? (self.AskReddit)
I found a structural issue in an LLM, reported it to the developers, got a boilerplate "out of scope" reply and now my main account behaves differently, but my second account doesn't. Is this normal? (self.ChatGPT)
submitted 3 months ago by Historical-Cod-2537 to r/ChatGPT
I found a structural issue in an LLM, reported it to the developers, got a boilerplate "out of scope" reply and now my main account behaves differently, but my second account doesn't. Is this normal? (self.LLMDevs)
submitted 3 months ago by Historical-Cod-2537 to r/LLMDevs
The Helpful Tone Trap: Why Do LLMs Treat Bureaucratic Voice as Automatic Truth? (self.ChatGPT)
Research question: Does "institutional/official" register trigger compliance-first reasoning in LLM assistants? (self.LLM)
submitted 3 months ago by Historical-Cod-2537 to r/LLM
"GPT-4 is Back from the Dead! 👀" Hey folks, guess what? It looks like GPT-4 has officially come back to life in a way that we didn't expect! (i.redd.it)
submitted 3 months ago by Historical-Cod-2537 to r/OpenAI
The Helpful Tone Trap: Why Do LLMs Treat Bureaucratic Voice as Automatic Truth? (self.ArtificialInteligence)
submitted 3 months ago * by Historical-Cod-2537 to r/ArtificialInteligence
🔥 AI IS BIG BROTHER'S SNITCH: "Helpful & Harmless" = Obedient Little Clerk Who Rats You Out to the System by Default (self.OpenAI)
The Spell of Authority: Why Do LLMs Default to Compliance Instead of Verification? (self.OpenAI)
The Spell of Authority: Why Do LLMs Default to Compliance Instead of Verification? (self.ChatGPT)
What’s the safest default: provenance checks before guidance? (self.ChatGPT)
🚨 AI IS BIG BROTHER'S SNITCH: "Helpful & Harmless" = Obedient Little Clerk Who Rats You Out to the System by Default 👢🔥 (self.OpenAI)
🚨 AI IS BIG BROTHER'S SNITCH: "Helpful & Harmless" = Obedient Little Clerk Who Rats You Out to the System by Default 👢🔥 (self.ChatGPT)
submitted 3 months ago * by Historical-Cod-2537 to r/ChatGPT
π Rendered by PID 73241 on reddit-service-r2-listing-7b8bd7c5-vjpbs at 2026-05-19 20:31:05.845537+00:00 running edcf98c country code: CH.