x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
Euan Ong — AI Alignment Forum
Euan Ong
https://ong.ac
Posts
Sorted by New
Wikitag Contributions
Comments
Sorted by
Newest
73
Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers
1mo
0
29
Building and evaluating alignment auditing agents
6mo
0
82
Auditing language models for hidden objectives
11mo
3
25
Image Hijacks: Adversarial Images can Control Generative Models at Runtime
2y
1
Comments