Do sparse autoencoders find "true features"? — AI Alignment Forum