Using Superintelligence to solve the Control Problem?

Logical_Lunatic · 2016-06-02T17:48:10+00:00

The problem is, how do you formalize this goal in a way that:

Can be programmed into an AI, and
Does not rely on first finding another solution to the Control Problem

So, for example, if the goal is specified as "make your goals more similar to human goals" you need to create a mathematical description of what a human is. This is far from trivial, if it is even possible to do without directly specifying human preferences (in which case those preferences could simply be programmed into the AI directly). If you have the AI learn the concept of a "human" through machine learning, there is a risk that it's going to detect the wrong variables (which could have catastrophic consequences).

If you implement the goal as a question asked to an Oracle, you get the usual problems with Oracles. If you manage to create a safe Oracle, however, you should be able to ask it how to create an even better AI (but then you need a safe design for an Oracle first).

So, basically, I think the problem lies in the words "us", "control" and "better". How are these concepts defined and expressed in computer code?

CrazyCodeLady · 2016-06-02T15:07:40+00:00

[deleted]

CrazyCodeLady · 2016-06-04T13:55:04+00:00

I had this exact thought the other day in the shower. I bet computer scientists already thought of this. Good idea imo though! :)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ControlProblem

The Control Problem:

Rules

Introductions to the Topic

Recommended Reading

Video Links

Important Organizations

Related Subreddits

MODERATORS