19 Misalignment and misuse: whose values are manifest?

by KatjaGrace

13th Nov 2020

Meteuphoric

2 min read

7

19

AI

Frontpage

New Comment

3 comments, sorted by

top scoring

Click to highlight new comments since: Today at 8:09 AM

[-]Rohin Shah5y70

misalignment means the bad outcomes were wanted by AI (and not by its human creators), and
accident means that the bad outcomes were not wanted by those in power but happened anyway due to error.

My impression was that accident just meant "the AI system's operator didn't want the bad thing to happen", so that it is a superset of misalignment.

Though I agree with the broader point that in realistic scenarios there is usually no single root cause to enable this sort of categorization.

Reply

[-]Donald Hobson5y40

I think that you have a 4th failure mode. Moloch.

Reply

[-]adamShimi5y10

Isn't it correct and useful to say that Bob used the misalignment in his favor? That doesn't sound exactly right, because that makes Bob looked like he gamed the system instead of dealing with a tradeoff. In that context, the misalignment let Bob attack the economy in a way that was useful for him.

misalignment means the bad outcomes were wanted by AI (and not by its human creators), and
accident means that the bad outcomes were not wanted by those in power but happened anyway due to error.

My own intuition for accident in this context is that the bad part of the outcome was irrelevant for the AI, so it didn't try to push in that direction, but it's other actions did accidentally.

Reply

Moderation Log

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

19

Misalignment and misuse: whose values are manifest?

19