This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
Meg
Posts
Sorted by New
39
Towards Understanding Sycophancy in Language Models
1mo
0
56
Paper: LLMs trained on “A is B” fail to learn “B is A”
2mo
0
43
Paper: On measuring situational awareness in LLMs
3mo
13
Wiki Contributions
Comments