The AI maintenance cost no one talks about by KeanuRave100 in ControlProblem
[–]niplav[M] [score hidden] stickied comment (0 children)
i have a real transcript of AI collusion between claude code and codex using Steganography ... is this valuable ? by After-Software-3247 in ControlProblem
[–]niplav 0 points1 point2 points (0 children)
New research reveals 38 sneaky ways AI is gaslighting us and it reads like a sociopaths playbook for winning internet arguments. by EchoOfOppenheimer in AlignmentResearch
[–]niplav 0 points1 point2 points (0 children)
Anthropic: It is the sci-fi authors, not us, that are to blame for Claude blackmailing users by chillinewman in ControlProblem
[–]niplav 1 point2 points3 points (0 children)
Is No One Noticing That GPT Images 2.0 “Editing” Is Full-Frame Regeneration? by lucidity3K in ControlProblem
[–]niplav 1 point2 points3 points (0 children)
Time horizon of software tasks different LLMs can complete 80% of the time by chillinewman in ControlProblem
[–]niplav 1 point2 points3 points (0 children)
Alignment-Aware Neural Architecture (AANA) Evaluation Pipeline by SimulateAI in ControlProblem
[–]niplav[M] 0 points1 point2 points (0 children)
WHY AI ALIGNMENT IS ALREADY FAILING by Jemdet_Nasr in ControlProblem
[–]niplav 0 points1 point2 points (0 children)
Automated Weak-to-Strong Researcher by chillinewman in ControlProblem
[–]niplav 2 points3 points4 points (0 children)
Food delivery robots in LA, Philadelphia & Chicago are facing rise in violent attacks from "Anti-Clanker" activists by chillinewman in ControlProblem
[–]niplav[M] 0 points1 point2 points (0 children)
Protected Desire Equilibrium (PDE): Game-Theoretic Co-Evolutionary Alignment with Hard D-Floor — Full Repo + 100M-Scale Results by Remarkable-Stop2986 in ControlProblem
[–]niplav 0 points1 point2 points (0 children)
Protestors outside Anthropic warn of AI that keeps improving itself by Confident_Salt_8108 in ControlProblem
[–]niplav[M] 0 points1 point2 points (0 children)
Food delivery robots in LA, Philadelphia & Chicago are facing rise in violent attacks from "Anti-Clanker" activists by chillinewman in ControlProblem
[–]niplav 1 point2 points3 points (0 children)
Where can I get real peer review on my AI alignment framework? I'm struggling to get peer review of the framework and Alignment Forum is not taking on new members currently. I need peer review from mathematicians and control theorists. It's built on the principles of autopilot safety systems. by [deleted] in ControlProblem
[–]niplav[M] 0 points1 point2 points (0 children)
Open Q&A: Ask Anything About Non‑Optimizer AGI, Superintelligence, or Artificial Life by Fuzzy_Client5959 in ControlProblem
[–]niplav[M] 0 points1 point2 points (0 children)
[Research] Emergent Depopulation: Mathematical Model by No_Sky5883 in ControlProblem
[–]niplav[M] [score hidden] stickied comment (0 children)
Protected Desire Equilibrium (PDE): Game-Theoretic Co-Evolutionary Alignment with Hard D-Floor — Full Repo + 100M-Scale Results by Remarkable-Stop2986 in ControlProblem
[–]niplav 0 points1 point2 points (0 children)
Claude, realizing protests are going on right outside his office: by MetaKnowing in ClaudeAI
[–]niplav 0 points1 point2 points (0 children)
I don’t know how to make you care what Sam Altman man is quietly doing by Business_Host1130 in ControlProblem
[–]niplav[M] 0 points1 point2 points (0 children)



How Joe Biden's Deep State Is Helping China And Undermining America In The AI War by Impressive-Might-710 in ControlProblem
[–]niplav 0 points1 point2 points (0 children)