x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
technicalities — AI Alignment Forum
technicalities
Posts
Sorted by New
Wikitag Contributions
Comments
Sorted by
Newest
Shallow review of live agendas in alignment & safety
technicalities
2y
1
0
I like this. It's like a structural version of control evaluations. Will think where to put it in
Reply
73
Shallow review of technical AI safety, 2024
1y
1
109
Shallow review of live agendas in alignment & safety
2y
17
45
ActAdd: Steering Language Models without Optimization
2y
2
31
Announcing the Alignment of Complex Systems Research Group
3y
11
I like this. It's like a structural version of control evaluations. Will think where to put it in