x
Extracting and Evaluating Causal Direction in LLMs' Activations — AI Alignment Forum