Finding the estimate of the value of a state in RL agents — AI Alignment Forum