This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
Rowan Wang
https://rowankwang.com/
Posts
Sorted by New
Wikitag Contributions
Comments
Sorted by
Newest
45
Modifying LLM Beliefs with Synthetic Document Finetuning
3mo
10
48
Some Lessons Learned from Studying Indirect Object Identification in GPT-2 small
3y
4
25
Gears-Level Mental Models of Transformer Interpretability
3y
1
Comments