x

AI ALIGNMENT FORUM

AF

adamk — AI Alignment Forum

Adam Khoja

Top postsTop post

Adam Khoja

Message

Math and CS undergraduate at UC Berkeley

165

Ω

21

4

6

5y

Adam Khoja

Math and CS undergraduate at UC Berkeley

The Bitter Lesson for AI Safety Research

Read the associated paper "Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?": https://arxiv.org/abs/2407.21792 Focus on safety problems that aren’t solved with scale. Benchmarks are crucial in ML to operationalize the properties we want models to have (knowledge, reasoning, ethics, calibration, truthfulness, etc.). They act as a criterion to judge...

Aug 2, 2024•58