This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
Peter Barnett
EA and AI safety
https://peterbarnett.org/
Posts
Sorted by New
0
peterbarnett's Shortform
1y
0
6
Doing oversight from the very start of training seems hard
4mo
1
13
Framings of Deceptive Alignment
9mo
0
20
Understanding Gradient Hacking
1y
2
Wiki Contributions
Comments