AI ALIGNMENT FORUM
AF

1033
Hannes Whittingham
000
Message
Dialogue
Subscribe

AI Safety Technical Research Manager at Meridian Research, Cambridge UK. Background in AI Control (MARS, LASR)

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No Comments Found
No wikitag contributions to display.
36Unfaithful Reasoning Can Fool Chain-of-Thought Monitoring
5mo
1