AI ALIGNMENT FORUM
AF

55
technicalities
Ω231300
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
Shallow review of live agendas in alignment & safety
technicalities2y10

I like this. It's like a structural version of control evaluations. Will think where to put it in

Reply
73Shallow review of technical AI safety, 2024
11mo
1
108Shallow review of live agendas in alignment & safety
2y
17
45ActAdd: Steering Language Models without Optimization
2y
2
31Announcing the Alignment of Complex Systems Research Group
3y
11