Anthony DiGiovanni

When is intent alignment sufficient or necessary to reduce AGI conflict?

by JesseClifton, Sammy Martin, and Anthony DiGiovanni

In this post, we look at conditions under which Intent Alignment isn't Sufficient or Intent Alignment isn't Necessary for interventions on AGI systems to reduce the risks of (unendorsed) conflict to be effective. We then conclude this sequence by listing what we currently think are relatively promising directions for technical...

Sep 14, 202240

Anthony DiGiovanni

Anthony DiGiovanni

Responses to apparent rationalist confusions about game / decision theory

When does technical work to reduce AGI conflict make a difference?: Introduction

When would AGIs engage in conflict?

When is intent alignment sufficient or necessary to reduce AGI conflict?

Anthony DiGiovanni

Responses to apparent rationalist confusions about game / decision theory

When does technical work to reduce AGI conflict make a difference?: Introduction

When would AGIs engage in conflict?

When is intent alignment sufficient or necessary to reduce AGI conflict?

Responses to apparent rationalist confusions about game / decision theory

When is intent alignment sufficient or necessary to reduce AGI conflict?

When would AGIs engage in conflict?

When does technical work to reduce AGI conflict make a difference?: Introduction