AI ALIGNMENT FORUM
AF

Tim Hua
Ω100000
Message
Dialogue
Subscribe

Current MATS scholar working with Neel Nanda and Samuel Marks. Formerly an economist at Walmart. 

Email me at the email available on my website at timhua.me if you want to reach me!

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
No Comments Found
68AI Induced Psychosis: A shallow investigation
6d
0
24Discovering Backdoor Triggers
14d
0
11Optimally Combining Probe Monitors and Black Box Monitors
1mo
1
5What is the functional role of SAE errors?
2mo
0
21SHIFT relies on token-level features to de-bias Bias in Bios probes
5mo
0