x

AI ALIGNMENT FORUM

AF

elandgre — AI Alignment Forum

elandgre

elandgre

Message

53

2

1

4y

elandgre

53

4y

Reflection Mechanisms as an Alignment target: A follow-up survey

by Marius Hobbhahn, elandgre, and Beth Barnes

This is the second of three posts (part I) about surveying moral sentiments related to AI alignment. This work was done by Marius Hobbhahn and Eric Landgrebe under the supervision of Beth Barnes as part of the AI safety camp 2022. TL;DR: We find that the results of our first...

Oct 5, 2022•21

Reflection Mechanisms as an Alignment target: A survey

by Marius Hobbhahn, elandgre, and Beth Barnes

This is a product of the 2022 AI Safety Camp. The project has been done by Marius Hobbhahn and Eric Landgrebe under the supervision of Beth Barnes. We would like to thank Jacy Reese Anthis and Tyna Eloundou for detailed feedback. You can find the google doc for this post...

Jun 22, 2022•32