x

AI ALIGNMENT FORUM

AF

Esben Kran

Top postsTop post

Esben Kran

Message

538

Ω

10

26

53

3

5y

Esben Kran

538

Ω

10

3

5y

Esben Kran — AI Alignment Forum

Can startups be impactful in AI safety?

With Lakera's strides in securing LLM APIs, Goodfire AI's path to scaling interpretability, and 20+ model evaluations startups among much else, there's a rising number of technical startups attempting to secure the model ecosystem. Of course, they have varying levels of impact on superintelligence containment and security and even with...

Sep 13, 2024•15

Finding Deception in Language Models

This June, Apart Research and Apollo Research joined forces to host the Deception Detection Hackathon. Bringing together students, researchers, and engineers from around the world to tackle a pressing challenge in AI safety; preventing AI from deceiving humans and overseers. The hackathon took place both online and in multiple physical...

Aug 20, 2024•20