x

AI ALIGNMENT FORUM

AF

Jan Kulveit — AI Alignment Forum

Jan Kulveit

Jan Kulveit

Message

175

Ω

34

1

20

8y

Jan Kulveit

175

Ω

34

8y

Box inversion hypothesis

This text originated from a retreat in late 2018, where researchers from FHI, MIRI and CFAR did an extended double-crux on AI safety paradigms, with Eric Drexler and Scott Garrabrant in the core. In the past two years I tried to improve it in terms of understandability multiple times, but...

Oct 20, 2020•59