SHIFT relies on token-level features to de-bias Bias in Bios probes — AI Alignment Forum