Large language models learn to represent the world
There's a nice recent paper whose authors did the following: 1. train a small GPT model on lists of moves from Othello games; 2. verify that it seems to have learned (in some sense) to play Othello, at least to the extent of almost always making legal moves; 3. use...