This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
technicalities
Posts
Sorted by New
85
Shallow review of live agendas in alignment & safety
1d
16
44
ActAdd: Steering Language Models without Optimization
3mo
2
31
Announcing the Alignment of Complex Systems Research Group
2y
11
Wiki Contributions
Comments
Shallow review of live agendas in alignment & safety
technicalities
7d
1
0
I like this. It's like a structural version of control evaluations. Will think where to put it in
Reply
I like this. It's like a structural version of control evaluations. Will think where to put it in