AI ALIGNMENT FORUM
AF

108
Wikitags

Incentives

Edited by plex, Yoav Ravid, et al. last updated 14th Oct 2021

An Incentive is a motivating factor, such as monetary reward, the risk of legal sanctions, or social feedback. Many systems are best understood by looking at the incentives of the people with power over them.

Inadequate Equilibria covers many problems that arise when there are poor incentives.

Related Pages: Game Theory, Mechanism Design, Moloch, Moral Mazes

Subscribe
Discussion
2
Subscribe
Discussion
2
Posts tagged Incentives
29A framework for thinking about AI power-seeking
Joe Carlsmith
1y
11
36Progress on Causal Influence Diagrams
tom4everitt
4y
6
15Reward Hacking from a Causal Perspective
tom4everitt, Francis Rhys Ward, sbenthall, James Fox, mattmacdermott, RyanCarey
2y
2
12Incentives from a causal perspective
tom4everitt, James Fox, RyanCarey, mattmacdermott, sbenthall, Jonathan Richens
2y
0
Add Posts