AI ALIGNMENT FORUMTags
AF

Academic Papers

EditHistorySubscribe
Discussion (0)
Help improve this page (3 flags)
EditHistorySubscribe
Discussion (0)
Help improve this page (3 flags)
Academic Papers
Random Tag
Contributors
2Kaj Sotala

Posts either linking to, or summarizing, formal papers published elsewhere.

Posts tagged Academic Papers
Most Relevant
1
61Some AI research areas and their relevance to existential safety
Andrew Critch
1y
37
2
482021 AI Alignment Literature Review and Charity Comparison
Larks
4mo
13
1
17Formal Solution to the Inner Alignment Problem
michaelcohen
1y
123
1
20Why is pseudo-alignment "worse" than other ways ML can fail to generalize?Q
nostalgebraist, Evan Hubinger
2y
Q
8
1
25How truthful is GPT-3? A benchmark for language models
Owain Evans
8mo
18
0
18Learning preferences by looking at the world
Rohin Shah
3y
4
0
26Human-AI Collaboration
Rohin Shah
3y
4
0
19Learning biases and rewards simultaneously
Rohin Shah
3y
3
0
11New paper: Corrigibility with Utility Preservation
Koen Holtman
3y
0
0
7Implications of Quantum Computing for Artificial Intelligence Alignment Research
Jaime Sevilla, Pablo Antonio Moreno Casares
3y
3
0
10New paper: The Incentives that Shape Behaviour
Ryan Carey
2y
3
1
5Demanding and Designing Aligned Cognitive Architectures
Koen Holtman
5mo
5
Add Posts