AI ALIGNMENT FORUM
AF

Wikitags

Value Drift

Edited by Dakara last updated 20th Nov 2024

Value drift refers to the idea that over time, the values or goals of a person or an AI system can change, often in ways that weren’t originally intended.

For humans, this might happen as life experiences, personal growth, or external influences cause someone's beliefs to evolve.

For AI, it could occur if the system starts to interpret its goals differently as it learns and interacts with the world.

Subscribe
1
Subscribe
1
Discussion0
Discussion0
Posts tagged Value Drift
23Understanding and avoiding value drift
TurnTrout
3y
7
11Would I think for ten thousand years?
Stuart_Armstrong
7y
7
Add Posts