What’s up with LLMs representing XORs of arbitrary features? — AI Alignment Forum