x

AI ALIGNMENT FORUM

AF

Research Agendas — AI Alignment Forum

Research Agendas

Edited by plex last updated 16th Sep 2021

Research Agendas lay out the areas of research which individuals or groups are working on, or those that they believe would be valuable for others to work on. They help make research more legible and encourage discussion of priorities.

Add Posts

1

1

Posts tagged Research Agendas

5

46Embedded Agents

abramdemski, Scott Garrabrant

7y

7

7

38The Learning-Theoretic AI Alignment Research Agenda

8y

37

5

12New safety research agenda: scalable agent alignment via reward modeling

7y

12

1

98On how various plans miss the hard bits of the alignment challenge

4y

48

4

44Paul's research agenda FAQ

8y

34

4

21Research Agenda v0.9: Synthesising a human's preferences into a utility function

Stuart_Armstrong

7y

18

3

36Our take on CHAI’s research agenda in under 1500 words

6y

14

1

72An overview of 11 proposals for building safe advanced AI

6y

32

5

55Shallow review of technical AI safety, 2025

technicalities, Tomáš Gavenčiak, Stephen McAleese, peligrietzer, Stag, jordinne, ozziegooen, Violet Hour, lenz

5mo

0

2

54Embedded Agency (full-text version)

Scott Garrabrant, abramdemski

7y

4

3

19Trying to isolate objectives: approaches toward high-level interpretability

3y

5

2

71Some conceptual alignment research projects

4y

6

1

52Davidad's Bold Plan for Alignment: An In-Depth Explanation

Charbel-Raphaël, Gabin

3y

5

2

54The Learning-Theoretic Agenda: Status 2023

3y

5

2

51Thoughts on Human Models

Ramana Kumar, Scott Garrabrant

7y

9

Load More (15/111)

Add Posts