AI ALIGNMENT FORUM
AF

Wikitags

Deconfusion

Edited by abramdemski last updated 17th Mar 2021

Narrowly, deconfusion is a specific branch of AI alignment research, discussed in MIRI's 2018 research update. More broadly, the term applies to any domain. Quoting from the research update:

By deconfusion, I mean something like “making it so that you can think about a given topic without continuously accidentally spouting nonsense.”

Subscribe
1
Subscribe
1
Discussion0
Discussion0
Posts tagged Deconfusion
26Looking Deeper at Deconfusion
adamShimi
4y
2
44Builder/Breaker for Deconfusion
abramdemski
3y
8
16Traps of Formalization in Deconfusion
adamShimi
4y
2
81. A Sense of Fairness: Deconfusing Ethics
RogerDearnaley
2y
0
42Deconfusing Direct vs Amortised Optimization
beren
3y
3
35Modelling Transformative AI Risks (MTAIR) Project: Introduction
Davidmanheim, Aryeh Englander
4y
0
25Applications for Deconfusing Goal-Directedness
adamShimi
4y
3
21Musings on general systems alignment
Alex Flint
4y
1
15Open problem: how can we quantify player alignment in 2x2 normal-form games?
Q
TurnTrout, Vanessa Kosoy
4y
Q
32
12A review of "Agents and Devices"
adamShimi
4y
0
11Goal-Directedness and Behavior, Redux
adamShimi
4y
2
10Approaches to gradient hacking
adamShimi
4y
7
12Alex Turner's Research, Comprehensive Information Gathering
adamShimi
4y
3
10Power-seeking for successive choices
adamShimi
4y
9
141Simulators
janus
3y
90
Load More (15/21)
Add Posts