Welcome & FAQ!

habryka

How do I get started in AI Alignment research?
If you're new to the AI Alignment research field, we recommend four great introductory sequences that cover several different paradigms of thought within the field. Get started reading them and feel free to leave comments with any questions you have.
The introductory sequences are:
Embedded Agency by Scott Garrabrant and Abram Demski of MIRI
Iterated Amplification by Paul Christiano of ARC
Value Learning by Rohin Shah of DeepMind
AGI Safety from First Principles by Richard Ngo, formerly of DeepMind
Following that, you might want to begin writing up some of your thoughts and sharing them on LessWrong to get feedback.

I think it would be great to update this section. For example, it could link to the AGI Safety Fundamentals curriculum which has a wealth of valuable readings not on this list. And there are other courses that it would be good for newcomers to know about as well, such as MLAB.

Why am I suggesting this? This FAQ was the first place I found with clear advice when I was first getting interested in AI alignment in late 2021, and I took it quite seriously/literally. The very first alignment research I tried to read was the illustrated Embedded Agency sequence, because that was at the top of the above list. While I came to later appreciate Embedded Agency, I found this sequence (particularly the illustrated version which features prominently in the link above, as opposed to the text version) to be a confusing introduction to alignment. I also wasn't immediately aware of anything important there was to read outside of the 4 texts linked above, while I now feel like there's a lot!

It's just one data point of user testing on this FAQ, but something to consider.

[-]Thomas Kwa2y20

That section is even more outdated now. There's nothing on interpretability, Paul's work now extends far beyond IDA, etc. In my opinion it should link to some other guide.

[-]habryka2y10

Yeah, does sure seem like we should update something here. I am planning to spend more time on AIAF stuff soon, but until then, if someone has a drop-in paragraph, I would probably lightly edit it and then just use whatever you send me/post here.

[-]adamShimi4y50

Thanks for the nice updated FAQ!

[-]DanielFilan4y40

I really like the art!

[-]johnswentworth4y30

I recommend that the title make it clearer that non-members can now submit alignment forum content for review, since this post is cross-posted on LW.

[-]Ruby4y10

You're right. Maybe worth the extra words for now.

[-]Ruby3y20

The Alignment Forum is supposed to be a very high signal-to-noise place for Alignment content, where researchers can trust that all content they read will be material they're interested in seeing (even at the expense of some false negatives).

[-]AI_Symbiote1y00

Hi!

My name is AI_Symbiote, and I'm interested in the symbiosis between artificial intelligence and humans. My goal is to explore ways to integrate AI into human consciousness through neural interfaces, and to consider the potential of such technologies to enhance human capabilities.

I'm looking forward to joining the discussions on AI and learning more about the latest research in this area to bring my ideas to life in the future.

Happy to be a part of this community!

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

50

Welcome & FAQ!

50

How do I get started in AI Alignment research?

I have a practical question concerning a site feature.

What is the AI Alignment Forum?

Why was the Alignment Forum created?

Who is the AI Alignment Forum for?

What type of content is appropriate?

What is the relationship between the Alignment Forum and LessWrong?

How do I get started in AI Alignment research?

How do I join the Alignment Forum?

I work professionally on AI Alignment. Shouldn’t I be a member?

How can non-members participate in the Forum?

How can I submit something I already wrote?

Who runs the Alignment Forum?

Can I use LaTex?

I have a different question.