Backdoor awareness and misaligned personas in reasoning models — AI Alignment Forum