This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
Decision Theory
•
Applied to
Exploiting Newcomb's Game Show
by
carterallen
5d
ago
•
Applied to
Decision Theory with the Magic Parts Highlighted
by
Ruben Bloom
13d
ago
•
Applied to
Can we learn much by studying the behaviour of RL policies?
by
AidanGoth
14d
ago
•
Applied to
Acausal trade naturally results in the Nash bargaining solution
by
Christopher King
21d
ago
•
Applied to
Is EDT correct? Does "EDT" == "logical EDT" == "logical CDT"?
by
Vivek Hebbar
22d
ago
•
Applied to
Averting Catastrophe: Decision Theory for COVID-19, Climate Change, and Potential Disasters of All Kinds
by
Jakub Kraus
1mo
ago
•
Applied to
The Unexpected Clanging
by
Chris_Leong
1mo
ago
•
Applied to
Alien Axiology
by
RobertM
1mo
ago
•
Applied to
GPT-4 is easily controlled/exploited with tricky decision theoretic dilemmas.
by
Ruben Bloom
1mo
ago
•
Applied to
"Do X because decision theory" ~= "Do X because bayes theorem"
by
L "Full Retard" C
1mo
ago
•
Applied to
Strong Cheap Signals
by
trevor
2mo
ago
•
Applied to
GPT-4 aligning with acasual decision theory when instructed to play games, but includes a CDT explanation that's incorrect if they differ
by
Christopher King
2mo
ago
•
Applied to
Payor's Lemma in Natural Language
by
RobertM
3mo
ago
•
Applied to
Don't Jump or I'll...
by
Double
3mo
ago
•
Applied to
Some Variants of Sleeping Beauty
by
Sylvester Kollin
3mo
ago
•
Applied to
Heuristics on bias to action versus status quo?
by
Farkas
3mo
ago
•
Applied to
Threatening to do the impossible: A solution to spurious counterfactuals for functional decision theory via proof theory
by
Christopher King
4mo
ago
•
Applied to
Modal Fixpoint Cooperation without Löb's Theorem
by
Vladimir Nesov
4mo
ago