This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
Auditing Games
•
Applied to
Hidden Cognition Detection Methods and Benchmarks
by
Paul Colognese
1mo
ago
•
Applied to
Automating Auditing: An ambitious concrete technical research proposal
by
Raymond Arnold
1y
ago
•
Applied to
A transparency and interpretability tech tree
by
Raymond Arnold
1y
ago
•
Applied to
Auditing games for high-level interpretability
by
Ruben Bloom
1y
ago
•
Created by
Ruben Bloom
at
1y