AI ALIGNMENT FORUMTags
AF

Newsletters

EditHistorySubscribe

Help improve this page (2 flags)

EditHistorySubscribe

Help improve this page (2 flags)

Contributors

You are viewing revision 1.1.0, last edited by Multicore

Newsletters are collected summaries of recent events, posts, and academic papers.

Posts tagged Newsletters

5

26QAPR 4: Inductive biases

2y

0

1

10[MLSN #8] Mechanistic interpretability, using law to inform AI alignment, scaling laws for proxy gaming

1y

0

2

40Quintin's alignment papers roundup - week 1

2y

0

1

19[AN #102]: Meta learning by GPT-3, and a list of full proposals for AI alignment

4y

0

1

13[AN #115]: AI safety research problems in the AI-GA framework

4y

2

1

24Quintin's alignment papers roundup - week 2

2y

0

1

17QAPR 3: interpretability-guided training of neural nets

2y

0

1

26[AN #166]: Is it crazy to claim we're in the most important century?

3y

2

0

21[AN #173] Recent language model results from DeepMind

2y

0

1

13[MLSN #6]: Transparency survey, provable robustness, ML models that predict the future

2y

0

1

12[AN #112]: Engineering a Safer World

4y

2

1

15[AN #170]: Analyzing the argument for risk from power-seeking AI

2y

0

1

10Alignment Newsletter #36

5y

0

1

16[AN #167]: Concrete ML safety problems and their relevance to x-risk

2y

4

1

12[AN #145]: Our three year anniversary!

3y

0