x

AI ALIGNMENT FORUM

AF

L Rudolf L — AI Alignment Forum

L Rudolf L

Top postsTop post

L Rudolf L

Message

My blog is here. You can subscribe for new posts there.

My personal site is here.

My X/Twitter is here

You can contact me using this form.

3195

Ω

18

24

86

5y

L Rudolf L

My blog is here. You can subscribe for new posts there.

My personal site is here.

My X/Twitter is here

You can contact me using this form.

Understanding and controlling auto-induced distributional shift

This post was written under Evan Hubinger’s direct guidance and mentorship, as a part of the Stanford Existential Risks Initiative ML Alignment Theory Scholars (MATS) program. TL;DR: The input distribution of a supervised machine learning system is modeled as an unchanging thing, and even in reinforcement learning there are types...

Dec 13, 2021•33