x

AI ALIGNMENT FORUM

AF

vonnik — AI Alignment Forum

vonnik

vonnik

Message

1

6y

vonnik

6y

Reward is not the optimization target

vonnik4y*0-11

The argument above isn’t clear to me, because I’m not sure how you’re defining your terms.

I should note that, contrary to the statement “reward is _not_, in general, that-which-is-optimized by RL agents”, by definition "reward _must be_ what is optimized for by RL agents." If they do not do that, they are not RL agents. At least, that is true based on the way the term “reward” is commonly used in the field of RL. That is what RL agents are programmed by humans to do. They do that by changing their behavior over many trials, and testing the results of ... (read more)