AI ALIGNMENT FORUMTags
AF

Exercises / Problem-Sets

EditHistorySubscribe
Discussion (0)
Help improve this page (2 flags)
EditHistorySubscribe
Discussion (0)
Help improve this page (2 flags)
Exercises / Problem-Sets
Random Tag
Contributors
2jacobjacob
2Yoav Ravid

This tag collects posts with concrete exercises. Problems that have solutions (or least some clear feedback loop). Things that you can attempt yourself in order to learn and grow. 

Related Pages: Games (posts describing)

Posts tagged Exercises / Problem-Sets
2
29Reframing Impact
Alex Turner
4y
4
2
23Topological Fixed Point Exercises
Scott Garrabrant, Sam Eisenstat
5y
40
2
57Alignment research exercises
Richard Ngo
2y
7
1
19Deducing Impact
Alex Turner
4y
18
1
22Value Impact
Alex Turner
4y
3
1
22World State is the Wrong Abstraction for Impact
Alex Turner
4y
6
1
27Mech Interp Puzzle 1: Suspiciously Similar Embeddings in GPT-Neo
Neel Nanda
5mo
2
1
17Deep learning curriculum for large language model alignment
Jacob Hilton
1y
3
1
17Infra-Exercises, Part 1
Diffractor, Jack Parker, Connall Garrod
1y
1
1
22Attainable Utility Landscape: How The World Is Changed
Alex Turner
4y
5
1
16Game Theory without Argmax [Part 1]
Cleo Nardo
22d
1
1
21Intro to Superposition & Sparse Autoencoders (Colab exercises)
CallumMcDougall
4d
0
1
17Mech Interp Puzzle 2: Word2Vec Style Embeddings
Neel Nanda
4mo
2
0
15Fixed Point Exercises
Scott Garrabrant
5y
0
1
23An Exercise to Build Intuitions on AGI Risk
Lauro Langosco
6mo
3
Load More (15/16)
Add Posts