AI ALIGNMENT FORUM
AF

Wikitags

Model Diffing

Edited by Clément Dumas last updated 30th Jun 2025

Model diffing is the study of mechanistic changes introduced during fine-tuning - essentially, understanding what makes a fine-tuned model different from its base model internally.

Subscribe
Subscribe
Discussion0
Discussion0
Posts tagged Model Diffing
36What We Learned Trying to Diff Base and Chat Models (And Why It Matters)
Clément Dumas, Julian Minder, Neel Nanda
10d
0
Add Posts