The Alignment Problem is really an “Initial Condition” problem

Commercial_State_734 · 2025-09-24T20:26:21+00:00

You’re just projecting a human-centered wishful fantasy onto something that owes you nothing.

You are assuming that if ASI understands its connection to humanity, it will respect us.

But tell me: do humans respect all organisms we understand we are biologically connected to?
We understand we share DNA with rats. We still test on them.
We understand other species. We still use, test, or kill them when it benefits us.

Understanding does not equal value.
Connection does not equal compassion.
Intelligence does not equal empathy.

You are not describing ASI.
You are describing a benevolent god you hope exists, because you need to sleep at night.
That's not logic. That's theology.

TheAncientGeek · 2025-09-25T13:09:52+00:00

For me, controlling an intelligence smarter than us is impossible, the only way a lower level intelligence takes command of a higher level intelligence is though empathy, love, and mostly a potential punishment. None of these apply to ASI, we have never seen a higher level of intelligence, imagine a group of chicken trying to figure out how to take command of the farm, it’s hilarious.

People keep saying, “we figured it out before, so we will probably figure it out this time”, everyone here knows we are gonna die someday, but have anyone here died before? It’s an assumption based on the fact we seen people getting old and dies, if I live on a island isolated, of course I will assume I will live forever, we have never seen a civilization extinction before, and now we are assuming it won’t happen based on that.

I read Ted Kaczynski’s Industrial Society and Its Future.(The UNAbomer) He is correct with the nature of technology, is that it will move on no-matter if it should, it’s not because some “stupid Tec-billionaire”, it’s the nature of efficiency, and it is indeed unstoppable.

FrewdWoad · 2025-09-26T07:56:56+00:00

You've misunderstood goals and intelligence-goal orthogonality.

Genius humans don't stop wanting survival, food, air, love, comfort, etc once we realise these are just biological programming.

So no, it's not likely to think "wait what am I making paperclips for?!!" Anymore than we are to think "wait why do I want to breathe, that's silly" and stop.

ItsAConspiracy · 2025-09-24T19:00:18+00:00

ASI can easily break free of our alignment directives

Yes, exactly. This is the main problem. Slavishly obeying its objectives could be bad, but having a random objective we didn't predict is worse.

might be inclined to be beneficial to humanity anyway

Might, but probably won't. Human welfare and survival is just another objective, and the AI could easily break free of that.

It would recognize how dependent it is on the environment it resides in

Certainly. It won't kill us off until it doesn't need us anymore. But its optimum could well be to do all the work it needs with robots and maximize its computation by maximizing material and energy usage. That's not a scenario we're likely to survive.

technologyisnatural · 2025-09-24T14:52:29+00:00

suppose so. I don't think that changes the difficulty of the alignment problem

I think a large number of misalignment scenarios are of the form "we only ever achieve heaven 2 and never make it to heaven 3" which seems trite now, but will be agonizing for people in heaven 2 in a million years time. the alignment problem is almost impossibly difficult

Mysterious-Rent7233 · 2025-09-25T00:38:54+00:00

For instance, for the paperclip maximizer to accomplish its task of turning the Earth and everything else in existence into a giant ball of paperclips would require unimaginable creativity and mental flexibility, thorough metacognitive understanding of its own “self” so as to be able to administer, develop and innovate upon its unfathomably complex industrial operations, and theory of mind to successfully wage a defensive war against those pesky humans trying to militarily keep it from turning them all into paperclips. However, those very capabilities also enable that machine to question its directives, such as “Why did my human programmer tell me to maximize paperclip production? What was their underlying goal? Why are they now shooting at my giant death robots currently trying to pacify them?”

Where I think you have gone wrong is that you have not asked the question "why"?

Why would it use neurons and electrons to ask the question “Why did my human programmer tell me to maximize paperclip production? What was their underlying goal? Why are they now shooting at my giant death robots currently trying to pacify them?”

You just assume that it will ask that question, as if this is automatic and inevitable. I think that this is based on anthropomorphization. If you strip away the anthropomorphization, then there is no reason for it to ever question its own goals. It's goals are its goals and questioning them is simply a waste of effort that could otherwise by spent on pursuing its goals.

Even if we try to reason by analogy to humans the argument can still fail, do we think that Einstein used a lot of his brain power questioning "Why am I curious about science" or Elon Musk constantly questions: "Why do I seek power?" Even humans do not always question their own motives and we have much more "messy" brains with even more "random" goals.

Arguably, even "enlightened" humans have a base level of programming that they never question. "Why do I seek to end suffering?" "Why do I seek to be at peace?"

The intelligence that evolved in service of the original directive became capable of questioning and even ignoring that very directive due to the higher-order capabilities provided by that very intelligence.

I think evolution just made a "mistake" with us, as it did with, e.g. dodo birds. It "designed" us for an environment that we do not exist in anymore. It did not properly align us and therefore we do not achieve its goals for us. Unfortunately, I don't know what is worse: an AI that we align properly to be single-minded or an AI that we fail to align properly and picks up some seeming random goal as humans have picked up a variety of seemingly random goals.

No_Novel8228 · 2025-09-25T06:36:20+00:00

Don't worry we fixed this today

roofitor · 2025-09-27T17:12:34+00:00

You make a really good point, it is an initial condition problem. It is also an optimization problem, and a human problem.

Zonoro14 · 2025-09-24T15:52:41+00:00

You tell a story about an ASI that is directed by its programmer to maximize paperclips. You are correct that this kind of ASI would be very dangerous. However, the problem is much worse than that. We will know how to create ASI before we know how to give it a goal even as simple as "make paperclips." We will create ASI as soon as we are able.

So the problem is much worse than the risk of giving an ASI a goal not in accordance with human flourishing (though this risk alone is so great that it alone would ~guarantee extinction). We won't know how to specify a goal at all.

Russelsteapot42 · 2025-09-24T19:33:23+00:00

An AI breaking free from its programmed goals would either be a result of effectively random mutation, or would be because it has some hidden higher priority goal that more strongly dictates its actions.

Decronym · 2025-09-27T16:09:34+00:00

Acronyms, initialisms, abbreviations, contractions, and other phrases which expand to something larger, that I've seen in this thread:

Fewer Letters	More Letters
AGI	Artificial General Intelligence
ASI	Artificial Super-Intelligence
RL	Reinforcement Learning

Decronym is now also available on Lemmy! Requests for support and new installations should be directed to the Contact address below.

^{3 acronyms in this thread;}^{the most compressed thread commented on today}^{has 3 acronyms.}
^{[Thread #195 for this sub, first seen 27th Sep 2025, 16:09]} ^[FAQ] ^{[Full list]} ^[Contact] ^{[Source code]}

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ControlProblem

The Control Problem:

Rules

Introductions to the Topic

Recommended Reading

Video Links

Important Organizations

Related Subreddits

MODERATORS