AI ALIGNMENT FORUM
AF

Value LearningWireheadingAI
Frontpage

11

Value extrapolation vs Wireheading

by Stuart Armstrong
17th Jun 2022
1 min read
1

11

Value LearningWireheadingAI
Frontpage
New Comment
Moderation Log
Curated and popular this week
0Comments
Mentioned in
27Benchmark for successful concept extrapolation/avoiding goal misgeneralization

Talk given by Rebecca Gorman and Stuart Armstrong at the CHAI 2022 Asilomar Conference. We present an example of AI wireheading (an AI taking over its own reward channel), and show how value extrapolation can be used to combat it.

https://www.youtube.com/watch?v=REUanSy0SgU