x

AI ALIGNMENT FORUM

AF

GMM — AI Alignment Forum

GMM

Top postsTop post

GMM

Message

1044

Ω

25

5

64

1

4y

GMM

1044

Ω

25

1

4y

The Bitter Lesson for AI Safety Research

by adamk, Richard Ren, Dan H, and GMM

Read the associated paper "Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?": https://arxiv.org/abs/2407.21792 Focus on safety problems that aren’t solved with scale. Benchmarks are crucial in ML to operationalize the properties we want models to have (knowledge, reasoning, ethics, calibration, truthfulness, etc.). They act as a criterion to judge...

Aug 2, 2024•58