x

AI ALIGNMENT FORUM

AF

research_prime_space — AI Alignment Forum

research_prime_space

Top postsTop post

research_prime_space

Message

52

Ω

6

5

16

9y

research_prime_space

52

Ω

6

9y

Penalize Model Complexity Via Self-Distillation

When you self-distill a model (e.g. train a new model using predictions from your old model), the resulting model represents a less complex function. After many rounds of self-distillation, you essentially end up with a constant function. This paper makes the above more precise. Anyway, if you apply multiple rounds...

Apr 4, 2023•15