Wiki Contributions

Comments

How hard was it to find the examples of goal misgeneralization? Did the results take much “coaxing”?