AI ALIGNMENT FORUMTags
AF

Reinforcement Learning

EditHistorySubscribe

Help improve this page (1 flag)

EditHistorySubscribe

Help improve this page (1 flag)

Reinforcement Learning

Exploration and Optimization

Further Reading & References

Contributors

2Miranda Dixon-Luinenburg

1Roman Leventov

Within the field of machine learning, reinforcement learning refers to the study of how to train agents to complete tasks by updating ("reinforcing") the agents with feedback signals.

Related: Inverse Reinforcement Learning, Machine learning, Friendly AI, Game Theory, Prediction, Agency.

Consider an agent that receives an input informing the agent of the environment's state. Based only on that information, the agent has to make a decision regarding which action to take, from a set, which will influence the state of the environment. This action will in itself change the state of the environment, which will result in a new input, and so on, each time also presenting the agent with the reward (or reinforcement signal) relative to its actions in the environment. In "policy gradient" approaches, the reinforcement signal is often used to update the agent (the "policy"), although sometimes an agent will do limited online (model-based) heuristic search to instead optimize the reward signal + heuristic evaluation. ...

Posts tagged Reinforcement Learning

8

48Think carefully before calling RL policies "agents"

1y

9

7

93Reward is not the optimization target

2y

88

4

28Draft papers for REALab and Decoupled Approval on tampering

Jonathan Uesato, Ramana Kumar

4y

2

2

64EfficientZero: How It Works

3y

2

3

14Remaking EfficientZero (as best I can)

2y

0

1

12AGI will have learnt utility functions

2y

1

1

14Why almost every RL agent does learned optimization

1y

3

2

13Interpreting the Learning of Deceit

Roger Dearnaley

7mo

2

1

80Models Don't "Get Reward"

2y

4

2

32Jitters No Evidence of Stupidity in RL

3y

0

0

76DeepMind: Generally capable agents emerge from open-ended play

Daniel Kokotajlo

3y

11

1

45Shard Theory: An Overview

2y

2

2

39LeCun’s “A Path Towards Autonomous Machine Intelligence” has an unsolved technical alignment problem

1y

23

1

43Reward Is Not Enough

3y

12

1

43My take on Vanessa Kosoy's take on AGI safety

3y

8