AI ALIGNMENT FORUM
AF

306
technicalities
Ω231300
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
Shallow review of live agendas in alignment & safety
technicalities2y10

I like this. It's like a structural version of control evaluations. Will think where to put it in

Reply
73Shallow review of technical AI safety, 2024
10mo
1
108Shallow review of live agendas in alignment & safety
2y
17
45ActAdd: Steering Language Models without Optimization
2y
2
31Announcing the Alignment of Complex Systems Research Group
3y
11