AI ALIGNMENT FORUM
AF

Julian Minder
000
Message
Dialogue
Subscribe

MATS 7.0 Scholar with Neel Nanda, interested in mechanistic interpretability and the what the process of finetuning does to models.

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No Comments Found
No wikitag contributions to display.
36What We Learned Trying to Diff Base and Chat Models (And Why It Matters)
10d
0