AI ALIGNMENT FORUMTags
AF

Formal Proof

•

Applied to Searching for Impossibility Results or No-Go Theorems for provable safety. by Maelstrom 1mo ago

•

Applied to An Opinionated Look at Inference Rules by Gianluca Calcagni 2mo ago

•

Applied to Limitations on Formal Verification for AI Safety by Andrew Dickson 3mo ago

•

Applied to Video Intro to Guaranteed Safe AI by Mike Vaiana 4mo ago

•

Applied to Compact Proofs of Model Performance via Mechanistic Interpretability by Jason Gross 5mo ago

•

Applied to Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems by Chipmonk 6mo ago

•

Applied to A list of core AI safety problems and how I hope to solve them by Helder S Ribeiro 8mo ago

•

Applied to Planning to build a cryptographic box with perfect secrecy by Lysandre Terrisse 10mo ago

•

Applied to Social Choice Theory and Logical Handshakes by StrivingForLegibility 10mo ago

•

Applied to Eleuther releases Llemma: An Open Language Model For Mathematics by mako yass 1y ago

•

Applied to I bet $500 on AI winning the IMO gold medal by 2026 by Kelvin Santos 1y ago

•

Applied to Roadmap for a collaborative prototype of an Open Agency Architecture by Deger Turan 2y ago

•

Applied to What Programming Language Characteristics Would Allow Provably Safe AI? by Noosphere89 2y ago

•

Applied to Davidad's Bold Plan for Alignment: An In-Depth Explanation by Charbel-Raphael Segerie 2y ago

•

Applied to Squeezing foundations research assistance out of formal logic narrow AI. by Lauren (often wrong) 2y ago

•

Applied to Speedrunning 4 mistakes you make when your alignment strategy is based on formal proof by Quinn Dougherty 2y ago