[ASoT] Finetuning, RL, and GPT's world prior — AI Alignment Forum