Help out Redwood Research’s interpretability team by finding heuristics implemented by GPT-2 small — AI Alignment Forum