x

AI ALIGNMENT FORUM

AF

ukc10014 — AI Alignment Forum

ukc10014

Top postsTop post

ukc10014

Message

240

12

14

4y

ukc10014

240

4y

Collective Identity

by Niki Dupuis, ukc10014, and Garrett Baker

Thanks to Simon Celinder, Quentin Feuillade--Montixi, Nora Ammann, Clem von Stengel, Guillaume Corlouer, Brady Pelkey and Mikhail Seleznyov for feedback on drafts. This post was written in connection with the AI Safety Camp. Executive Summary: This document proposes an approach to corrigibility that focuses on training generative models to function...

May 18, 2023•59