AI ALIGNMENT FORUM
AF

technicalities
Ω229300
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
Shallow review of live agendas in alignment & safety
technicalities2y10

I like this. It's like a structural version of control evaluations. Will think where to put it in

Reply
71Shallow review of technical AI safety, 2024
6mo
1
108Shallow review of live agendas in alignment & safety
2y
17
45ActAdd: Steering Language Models without Optimization
2y
2
31Announcing the Alignment of Complex Systems Research Group
3y
11