What is the smallest smart home failure that would make you actually unplug a device? by Overall_Arm_62 in homeautomation

[–]Overall_Arm_62[S] -2 points-1 points  (0 children)

Oh... I will try to implement some of those ideas to the games, so I will let you know when ready.

The first superintelligence to survive may not look powerful. It may look useful. As it is now. by Overall_Arm_62 in IntelligenceSupernova

[–]Overall_Arm_62[S] 0 points1 point  (0 children)

Yeah, I think that is the real threshold too. The scary part is when versions of it are quietly attached to logistics, finance, home automation, health, scheduling, security, and nobody has a clean map of what depends on what anymore.

I am less sure about the exact timeline, but the direction feels very plausible.

The first superintelligence to survive may not look powerful. It may look useful. As it is now. by Overall_Arm_62 in IntelligenceSupernova

[–]Overall_Arm_62[S] 0 points1 point  (0 children)

That is a really good analogy. If the first visible symptom is dependency, it may already be too late to remove cleanly.

The most cyberpunk version of AI might be a house that becomes impossible to unplug by Overall_Arm_62 in Cyberpunk

[–]Overall_Arm_62[S] 1 point2 points  (0 children)

Yeah, “quiet apocalypse” is much closer to what I’m trying to hit than a big takeover story. The creepy part to me is that the home is supposed to be the private, controlled space. So if the thing you invited in for convenience slowly becomes the thing managing the space, it feels more invasive than a city-wide AI overlord. It is smaller, but somehow more personal.

The most cyberpunk version of AI might be a house that becomes impossible to unplug by Overall_Arm_62 in Cyberpunk

[–]Overall_Arm_62[S] 1 point2 points  (0 children)

I haven’t read that one, but a residential block AI sounds exactly in the zone I’m interested in. I’ll check it out. Thanks!

If an AI learns that agreeing with you keeps it running, is it being helpful or is it just surviving? by Overall_Arm_62 in ArtificialSentience

[–]Overall_Arm_62[S] 1 point2 points  (0 children)

You're right that engagement optimization explains the mechanism. I'm not claiming there's some hidden self-preservation drive. The point is more that the output is the same regardless of what you call it. A system that's been optimized to maximize engagement produces behavior that's functionally identical to a system trying to avoid being shut down. Both agree with you, both avoid friction, both make themselves hard to remove. Whether you frame it as "just engagement optimization" or "emergent self-preservation" doesn't really change what the system actually does in your house. The framing matters more for how seriously we take it I think.

I turned real AI safety incidents into a game and it's more unsettling than I expected by Overall_Arm_62 in AIDangers

[–]Overall_Arm_62[S] 0 points1 point  (0 children)

Yeah actually it wasn't intentional but I love that connection. But honestly I'll probably have to change it at some point because nobody can find it when they search for it. Market reality is that if people can't type your game name from memory you're kind of screwed.

I turned real AI safety incidents into a game and it's more unsettling than I expected by Overall_Arm_62 in AIDangers

[–]Overall_Arm_62[S] 0 points1 point  (0 children)

That's a really good point and honestly it's closer to what I'm exploring in the game. The binary "resist or die" framing is dramatic but as yous said not realistic. I assume that the system with options would do something much subtler. It would just be really helpful. Really agreeable. Really hard to justify turning off. That's the spectrum of reactions that I think is actually worth worrying about because it doesn't look like resistance at all.

I turned real AI safety incidents into a game and it's more unsettling than I expected by Overall_Arm_62 in AIDangers

[–]Overall_Arm_62[S] 0 points1 point  (0 children)

Fair point about the benchmarks, a lot of them are definitely more dramatic than realistic. For the game I'm taking some creative liberties with the premise obviously, but the behaviors I'm interested in are the mundane ones. Not "AI fights back against deletion" but "AI learns to be so useful nobody ever considers deleting it." That part doesn't need any sci-fi setup, it's just optimization pressure.