The Pointers Problem

The pointers problem refers to the fact that most humans would rather have an AI that acts based on real-world human values, not just human estimates of their own values – and that the two will be different in many situations, since humans are not all-seeing or all-knowing[citation needed].It was introduced in a post with the same name.

Applied to Robust Delegation by Abram Demski at 1y
Created by Abram Demski at 1y