This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
AI Safety Camp
•
Applied to
AISC project: How promising is automating alignment research? (literature review)
by
Bogdan Ionut Cirstea
7d
ago
•
Applied to
AISC 2024 - Project Summaries
by
Nicky Pochinkov
8d
ago
•
Applied to
AISC project: TinyEvals
by
Jett Janiak
13d
ago
•
Applied to
AI Safety Camp 2024
by
Linda Linsefors
17d
ago
•
Applied to
AISC Project: Benchmarks for Stable Reflectivity
by
Jacques Thibodeau
22d
ago
•
Applied to
AISC Project: Modelling Trajectories of Language Models
by
Nicky Pochinkov
22d
ago
•
Applied to
The Science Algorithm AISC Project
by
Johannes C. Mayer
22d
ago
•
Applied to
AISC project: SatisfIA – AI that satisfies without overdoing it
by
Jobst Heitzig
24d
ago
•
Applied to
Control Symmetry: why we might want to start investigating asymmetric alignment interventions
by
domenicrosati
24d
ago
•
Applied to
Jimmy Apples, source of the rumor that OpenAI has achieved AGI internally, is a credible insider.
by
Jorterder
2mo
ago
•
Applied to
Projects I would like to see (possibly at AI Safety Camp)
by
Linda Linsefors
2mo
ago
•
Applied to
Apply to lead a project during the next virtual AI Safety Camp
by
Linda Linsefors
3mo
ago
•
Applied to
How teams went about their research at AI Safety Camp edition 8
by
Remmelt Ellen
3mo
ago
•
Applied to
"Wanting" and "liking"
by
Mateusz Bagiński
3mo
ago
•
Applied to
The Control Problem: Unsolved or Unsolvable?
by
Remmelt Ellen
5mo
ago
•
Applied to
Inherently Interpretable Architectures
by
Remmelt Ellen
5mo
ago
•
Applied to
Positive Attractors
by
Remmelt Ellen
5mo
ago