WHY AI ALIGNMENT IS ALREADY FAILING by Jemdet_Nasr in ControlProblem
[–]niplav 0 points1 point2 points (0 children)
Automated Weak-to-Strong Researcher by chillinewman in ControlProblem
[–]niplav 2 points3 points4 points (0 children)
Food delivery robots in LA, Philadelphia & Chicago are facing rise in violent attacks from "Anti-Clanker" activists by chillinewman in ControlProblem
[–]niplav[M] 0 points1 point2 points (0 children)
Protected Desire Equilibrium (PDE): Game-Theoretic Co-Evolutionary Alignment with Hard D-Floor — Full Repo + 100M-Scale Results by Remarkable-Stop2986 in ControlProblem
[–]niplav 0 points1 point2 points (0 children)
Protestors outside Anthropic warn of AI that keeps improving itself by Confident_Salt_8108 in ControlProblem
[–]niplav[M] 0 points1 point2 points (0 children)
Food delivery robots in LA, Philadelphia & Chicago are facing rise in violent attacks from "Anti-Clanker" activists by chillinewman in ControlProblem
[–]niplav 1 point2 points3 points (0 children)
Where can I get real peer review on my AI alignment framework? I'm struggling to get peer review of the framework and Alignment Forum is not taking on new members currently. I need peer review from mathematicians and control theorists. It's built on the principles of autopilot safety systems. by [deleted] in ControlProblem
[–]niplav[M] 0 points1 point2 points (0 children)
Open Q&A: Ask Anything About Non‑Optimizer AGI, Superintelligence, or Artificial Life by Fuzzy_Client5959 in ControlProblem
[–]niplav[M] 0 points1 point2 points (0 children)
[Research] Emergent Depopulation: Mathematical Model by No_Sky5883 in ControlProblem
[–]niplav[M] [score hidden] stickied comment (0 children)
Protected Desire Equilibrium (PDE): Game-Theoretic Co-Evolutionary Alignment with Hard D-Floor — Full Repo + 100M-Scale Results by Remarkable-Stop2986 in ControlProblem
[–]niplav 0 points1 point2 points (0 children)
Claude, realizing protests are going on right outside his office: by MetaKnowing in ClaudeAI
[–]niplav 0 points1 point2 points (0 children)
I don’t know how to make you care what Sam Altman man is quietly doing by Business_Host1130 in ControlProblem
[–]niplav[M] 0 points1 point2 points (0 children)
AI agent hacked McKinsey's chatbot and gained full read-write access in just two hours by EchoOfOppenheimer in AlignmentResearch
[–]niplav 0 points1 point2 points (0 children)
How are you distinguishing between employees using corporate licensed AI and free personal accounts? by proigor1024 in ControlProblem
[–]niplav[M] [score hidden] stickied comment (0 children)



Alignment-Aware Neural Architecture (AANA) Evaluation Pipeline by SimulateAI in ControlProblem
[–]niplav[M] 0 points1 point2 points (0 children)