AI ALIGNMENT FORUM
AF

Ran W
Ω11100
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
No Comments Found
11Why do we need RLHF? Imitation, Inverse RL, and the role of reward
1y
0