This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
lewis smith
Posts
Sorted by New
8
lewis smith's Shortform
7mo
0
29
A Problem to Solve Before Building a Deception Detector
1mo
0
92
The ‘strong’ feature hypothesis could be wrong
7mo
0
39
Improving Dictionary Learning with Gated Sparse Autoencoders
11mo
32
40
[Full Post] Progress Update #1 from the GDM Mech Interp Team
11mo
3
36
[Summary] Progress Update #1 from the GDM Mech Interp Team
11mo
0
Wikitag Contributions
Comments
Sorted by
Newest