This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
technicalities
Posts
Sorted by New
99
Shallow review of live agendas in alignment & safety
4mo
17
45
ActAdd: Steering Language Models without Optimization
7mo
2
31
Announcing the Alignment of Complex Systems Research Group
2y
11
Wiki Contributions
Comments
Shallow review of live agendas in alignment & safety
technicalities
4mo
1
0
I like this. It's like a structural version of control evaluations. Will think where to put it in
Reply
I like this. It's like a structural version of control evaluations. Will think where to put it in