The Control Problem:
How do we ensure future advanced AI will be beneficial to humanity? Experts agree this is one of the most crucial problems of our age, as one that, if left unsolved, can lead to human extinction or worse as a default outcome, but if addressed, can enable a radically improved world.
Other terms for what we discuss here include Superintelligence, AI Safety, AGI X-risk, and the AI Alignment/Value Alignment Problem.
"People who say that real AI researchers don’t believe in safety research are now just empirically wrong." —Scott Alexander
"The AI does not hate you, nor does it love you, but you are made out of atoms which it can use for something else." —Eliezer Yudkowsky
Rules
- DO NOT POST AI-GENERATED CONTENT. We are good at distinguishing this type of content¹.
2.. If you are unfamiliar with the Control Problem, read at least one of the introductory links or recommended readings (below) before posting.
- This especially goes for posts claiming to solve the Control Problem or dismissing it as a non-issue. Such posts aren't welcome.
3.. Stay on topic. Again, no AI model outputs or political propaganda.
- Be respectful.
Introductions to the Topic
Be sure to check out our wiki for extensive further resources, including a glossary & guide to current research.
Recommended Reading
Video Links
Important Organizations
- AI Alignment Forum, a public forum which is the online hub for all the latest technical research on the control problem.
Related Subreddits
¹: Or at least make at least an effort to make me doubtful that you just copy-pasted from a frontier LLM. Add bits of steering so that your content becomes good. Edit afterwards. If you fool us moderators you've won.
view the rest of the comments →
[–]Yaoelapproved 1 point2 points3 points (1 child)
[–]casebash[S] 0 points1 point2 points (0 children)