AI ALIGNMENT FORUM
AF

Meg
000
Message
Subscribe to posts

Posts

Sorted by New
39Towards Understanding Sycophancy in Language Models
1mo
0
56Paper: LLMs trained on “A is B” fail to learn “B is A”
2mo
0
43Paper: On measuring situational awareness in LLMs
3mo
13

Wiki Contributions

No wiki contributions to display.

Comments

No Comments Found