Aidan Ewart

Undergraduate student studying Mathematics @ University of Bristol.

Interested in & persuing a career in technical AI safety.

Wiki Contributions

Comments

Yep! We are planning to do exactly that for (at least) the models we focus on in the paper (Pythia-70m + Pythia-410m), and probably also GPT2 small. We are also working on cleaning up our codebase (https://github.com/HoagyC/sparse_coding) and implementing some easy dictionary training solutions.