AI ALIGNMENT FORUMTags
AF

AI Safety Camp

•

Applied to Podcast interview series featuring Dr. Peter Park by jacobhaimes 1mo ago

•

Applied to INTERVIEW: Round 2 - StakeOut.AI w/ Dr. Peter Park by jacobhaimes 1mo ago

•

Applied to A Review of Weak to Strong Generalization [AI Safety Camp] by sevdeawesome 2mo ago

•

Applied to INTERVIEW: StakeOut.AI w/ Dr. Peter Park by jacobhaimes 2mo ago

•

Applied to Paper review: “The Unreasonable Effectiveness of Easy Training Data for Hard Tasks” by Vassil Tashev 2mo ago

•

Applied to Inducing human-like biases in moral reasoning LMs by artkpv 2mo ago

•

Applied to Why I take short timelines seriously by Nicholas Kees Dupuis 3mo ago

•

Applied to This might be the last AI Safety Camp by Remmelt Ellen 3mo ago

•

Applied to Interview: Applications w/ Alice Rigg by jacobhaimes 4mo ago

•

Applied to Funding case: AI Safety Camp by Remmelt Ellen 4mo ago

•

Applied to AISC project: How promising is automating alignment research? (literature review) by Bogdan Ionut Cirstea 5mo ago

•

Applied to AISC 2024 - Project Summaries by Nicky Pochinkov 5mo ago

•

Applied to AISC project: TinyEvals by Jett Janiak 5mo ago

•

Applied to AI Safety Camp 2024 by Linda Linsefors 5mo ago

•

Applied to AISC Project: Benchmarks for Stable Reflectivity by Jacques Thibodeau 5mo ago

•

Applied to AISC Project: Modelling Trajectories of Language Models by Nicky Pochinkov 5mo ago