This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
Subagents
•
Applied to
The hostile telepaths problem
by
Kaj Sotala
12d
ago
•
Applied to
Resolving von Neumann-Morgenstern Inconsistent Preferences
by
niplav
18d
ago
•
Applied to
Species as Canonical Referents of Super-Organisms
by
Yudhister Kumar
22d
ago
•
Applied to
Indecision and internalized authority figures
by
Kaj Sotala
4mo
ago
•
Applied to
Should rationalists be spiritual / Spirituality as overcoming delusion
by
Kaj Sotala
8mo
ago
•
Applied to
Quick thoughts on the implications of multi-agent views of mind on AI takeover
by
Kaj Sotala
11mo
ago
•
Applied to
Game Theory without Argmax [Part 2]
by
Cleo Nardo
1y
ago
•
Applied to
Game Theory without Argmax [Part 1]
by
Cleo Nardo
1y
ago
•
Applied to
One: a story
by
Kaj Sotala
1y
ago
•
Applied to
Wildfire of strategicness
by
DanielFilan
1y
ago
•
Applied to
Resolving internal conflicts requires listening to what parts want
by
Kaj Sotala
1y
ago
•
Applied to
A Clearer Thinking tool that teaches you to use Internal Family Systems concepts
by
spencerg
2y
ago
•
Applied to
Goodhart's Law inside the human mind
by
Kaj Sotala
2y
ago
•
Applied to
The self-unalignment problem
by
Kaj Sotala
2y
ago
•
Applied to
Remarks 1–18 on GPT (compressed)
by
Cleo Nardo
2y
ago
•
Applied to
Slack matters more than any outcome
by
Kaj Sotala
2y
ago
•
Applied to
Prosaic misalignment from the Solomonoff Predictor
by
Cleo Nardo
2y
ago
•
Applied to
Internal communication framework
by
Kaj Sotala
2y
ago