zeshen — AI Alignment Forum

A newcomer’s guide to the technical AI safety field

This post was written during Refine. Thanks to Jonathan Low, Linda Linsefors, Koen Holtman, Aaron Scher, and Nicholas Kees Dupuis for helpful discussion and feedback. Disclaimer: This post reflects my current understanding of the field and may not be an accurate representation of it. Feel free to comment if you...

Nov 4, 202244

Embedding safety in ML development

This post was written as part of Refine. Thanks to Adam Shimi, Alexander Gietelink Oldenziel, and Vanessa Kosoy for helpful discussion and feedback. Summary This post aims to: * Advocate for embedding safety into development of machine learning models * Propose a framing on how to think about safety, where...

Oct 31, 202224

My Thoughts on the ML Safety Course

This summary was written as part of Refine. The ML Safety Course is created by Dan Hendrycks at the Center for AI Safety. Thanks to Adam Shimi and Thomas Woodside for helpful feedback. Overview Background I recently completed the ML Safety Course by watching the videos and browsing through the...

Sep 27, 202250

Levels of goals and alignment

This post was written as part of Refine. Thanks to Adam Shimi, Lucas Teixeira, Linda Linsefors, and Jonathan Low for helpful feedback and comments. Epistemic status: highly uncertain. This post reflects my understanding of the terminologies and may not reflect the general consensus of AI alignment researchers (if any). Motivation...

Sep 16, 202227

What if we approach AI safety like a technical engineering safety problem

This post has been written for the second Refine blog post day, at the end of the first week of iterating on ideas and concretely aiming at the alignment problem. Thanks to Adam Shimi, Paul Bricman, and Daniel Clothiaux for helpful discussion and comments. Introduction This post aims to provide...

Aug 20, 202236

I missed the crux of the alignment problem the whole time

This post has been written for the first Refine blog post day, at the end of the week of readings, discussions, and exercises about epistemology for doing good conceptual research. Thanks to Adam Shimi for helpful discussion and comments. I first got properly exposed to AI alignment ~1-2 years ago....

Aug 13, 202253

Chin Ze Shen

Chin Ze Shen

Chin Ze Shen

I missed the crux of the alignment problem the whole time

My Thoughts on the ML Safety Course

A newcomer’s guide to the technical AI safety field

What if we approach AI safety like a technical engineering safety problem

Chin Ze Shen

I missed the crux of the alignment problem the whole time

My Thoughts on the ML Safety Course

A newcomer’s guide to the technical AI safety field

What if we approach AI safety like a technical engineering safety problem

A newcomer’s guide to the technical AI safety field

Embedding safety in ML development

My Thoughts on the ML Safety Course

Levels of goals and alignment

What if we approach AI safety like a technical engineering safety problem

I missed the crux of the alignment problem the whole time