Paradigm-Building for AGI Safety Research

This sequence introduces and defends an end-to-end theoretical framework for AGI safety research. Beginning with foundational principles in effective altruism, the framework moves hierarchically: it starts by formalizing the problem that AGI safety research intends to solve and proceeds to specify the progression of field-level questions that seem like they must be answered to solve the problem. This sequence is intended for new and seasoned researchers alike—my hope is that this report will help organize and streamline the individual efforts of safety researchers and facilitate crisper field-level communication, collaboration, and debate.

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

Paradigm-Building for AGI Safety Research