AI ALIGNMENT FORUM
AF

Wikitags

Experiments

This page is a stub.
Subscribe
Subscribe
Discussion0
Discussion0
Posts tagged Experiments
15Inference-Only Debate Experiments Using Math Problems
Arjun Panickssery, Abhimanyu Pallavi Sudhir, JacksonKaunismaa
1y
0
16Ten experiments in modularity, which we'd like you to run!
CallumMcDougall, Lucius Bushnaq, Avery
3y
0
23Smoke without fire is scary
Adam Jermyn
3y
9
17The need for multi-agent experiments
Martín Soto
1y
0
16Early Experiments in Human Auditing for AI Control
Joey Yudelson, Buck Shlegeris
6mo
0
4Simple experiments with deceptive alignment
Andreas_Moe
2y
0
Add Posts