I have a question about the conjecture at the end of Direction 17.5. Let U1 be a utility function with values in [0,1] and let f:[0,1]→[0,1] be a strictly monotonous function. Then U1 and U2=f∘U1 have the same maxima. f can be non-linear, e.g. f(x)=x2. Therefore, I wonder if the condition u(y)=αv(y)+β should be weaker.
Moreover, I ask myself if it is possible to modify U1 by a small amount at a place far away from the optimal policy such that π is still optimal fo... (read more)
I have a question about the conjecture at the end of Direction 17.5. Let U1 be a utility function with values in [0,1] and let f:[0,1]→[0,1] be a strictly monotonous function. Then U1 and U2=f∘U1 have the same maxima. f can be non-linear, e.g. f(x)=x2. Therefore, I wonder if the condition u(y)=αv(y)+β should be weaker.
Moreover, I ask myself if it is possible to modify U1 by a small amount at a place far away from the optimal policy such that π is still optimal fo... (read more)