Deconfusion

Edited by abramdemski last updated 17th Mar 2021

Narrowly, deconfusion is a specific branch of AI alignment research, discussed in MIRI's 2018 research update. More broadly, the term applies to any domain. Quoting from the research update:

By deconfusion, I mean something like “making it so that you can think about a given topic without continuously accidentally spouting nonsense.”

Posts tagged Deconfusion

12

26Looking Deeper at Deconfusion

adamShimi

5y

2

8

44Builder/Breaker for Deconfusion

abramdemski

4y

8

16Traps of Formalization in Deconfusion

adamShimi

5y

2

81. A Sense of Fairness: Deconfusing Ethics

RogerDearnaley

3y

0

1

42Deconfusing Direct vs Amortised Optimization

beren

3y

3

1

35Modelling Transformative AI Risks (MTAIR) Project: Introduction

Davidmanheim, Aryeh Englander

5y

0

1

25Applications for Deconfusing Goal-Directedness

adamShimi

5y

3

1

21Musings on general systems alignment

Alex Flint

5y

1

15Open problem: how can we quantify player alignment in 2x2 normal-form games?

Q

TurnTrout, Vanessa Kosoy

5y

Q

32

1

12A review of "Agents and Devices"

adamShimi

5y

0

1

11Goal-Directedness and Behavior, Redux

adamShimi

5y

2

1

10Approaches to gradient hacking

adamShimi

5y

7

1

12Alex Turner's Research, Comprehensive Information Gathering

adamShimi

5y

3

1

10Power-seeking for successive choices

adamShimi

5y

9

1

144Simulators

janus

4y

90