x

AI ALIGNMENT FORUM

AF

A Ray — AI Alignment Forum

A Ray

Top postsTop post

A Ray

Message

983

Ω

157

8

159

9y

A Ray

983

Ω

157

9y

Steganography in Chain of Thought Reasoning

Here I give a possible phenomenon of steganography in chain of thought reasoning, where a system doing multi-stage reasoning with natural language encodes hidden information in its outputs that is not observable by humans, but can be used to boost its performance on some task. I think this could happen...

Aug 8, 2022•63

Alex Ray's Shortform

Nov 8, 2020•2