x

AI ALIGNMENT FORUM

AF

chanind — AI Alignment Forum

chanind

Top postsTop post

chanind

Message

495

Ω

8

12

24

3y

chanind

495

Ω

8

3y

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders

Adam Karvonen*, Can Rager*, Johnny Lin*, Curt Tigges*, Joseph Bloom*, David Chanin, Yeu-Tong Lau, Eoin Farrell, Arthur Conmy, Callum McDougall, Kola Ayonrinde, Matthew Wearden, Samuel Marks, Neel Nanda *equal contribution TL;DR * We are releasing SAE Bench, a suite of 8 diverse sparse autoencoder (SAE) evaluations including unsupervised metrics and...

Dec 11, 2024•82