AI ALIGNMENT FORUM
AF

Wikitags

Deconfusion

Written by Abram Demski last updated 17th Mar 2021

Narrowly, deconfusion is a specific branch of AI alignment research, discussed in MIRI's 2018 research update. More broadly, the term applies to any domain. Quoting from the research update:

By deconfusion, I mean something like “making it so that you can think about a given topic without continuously accidentally spouting nonsense.”

Subscribe
1
Subscribe
1
Discussion0
Discussion0
Posts tagged Deconfusion
26Looking Deeper at Deconfusion
Adam Shimi
4y
2
44Builder/Breaker for Deconfusion
Abram Demski
3y
8
16Traps of Formalization in Deconfusion
Adam Shimi
4y
2
81. A Sense of Fairness: Deconfusing Ethics
Roger Dearnaley
2y
0
42Deconfusing Direct vs Amortised Optimization
Beren Millidge
3y
3
35Modelling Transformative AI Risks (MTAIR) Project: Introduction
David Manheim, Aryeh Englander
4y
0
25Applications for Deconfusing Goal-Directedness
Adam Shimi
4y
3
21Musings on general systems alignment
Alex Flint
4y
1
15Open problem: how can we quantify player alignment in 2x2 normal-form games?
Q
Alex Turner, Vanessa Kosoy
4y
Q
32
12A review of "Agents and Devices"
Adam Shimi
4y
0
11Goal-Directedness and Behavior, Redux
Adam Shimi
4y
2
10Approaches to gradient hacking
Adam Shimi
4y
7
12Alex Turner's Research, Comprehensive Information Gathering
Adam Shimi
4y
3
10Power-seeking for successive choices
Adam Shimi
4y
9
137Simulators
janus
3y
90
Load More (15/21)
Add Posts