AI ALIGNMENT FORUM
AF

24
Benjamin Hilton
Ω69100
Message
Dialogue
Subscribe

Head of Alignment at UK AI Security Institute (AISI). Previously 80,000 Hours, HM Treasury, Cabinet Office, Department for International Trade, Imperial College London.

Sequences

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
The Alignment Project Research Agenda
UK AISI Alignment Team: Debate Sequence
No Comments Found
No wikitag contributions to display.
6Research Areas in Methods for Post-training and Elicitation (The Alignment Project by UK AISI)
2mo
0
7Research Areas in Benchmark Design and Evaluation (The Alignment Project by UK AISI)
2mo
0
3Research Areas in Probabilistic Methods (The Alignment Project by UK AISI)
2mo
0
8Research Areas in Evaluation and Guarantees in Reinforcement Learning (The Alignment Project by UK AISI)
2mo
0
13The Alignment Project by UK AISI
2mo
0
35An alignment safety case sketch based on debate
5mo
15
47UK AISI’s Alignment Team: Research Agenda
5mo
2
33A sketch of an AI control safety case
8mo
0
38Automation collapse
1y
7