the QACI alignment plan: table of contents

Tamsin Leake

the QACI alignment plan: table of contents

by Tamsin Leake

1 min read21st Mar 2023No comments

26

Research AgendasAI

Frontpage

This is a linkpost for https://carado.moe/qaci.html

this post aims to keep track of posts relating to the question-answer counterfactual interval proposal for AI alignment, abbreviated "QACI" and pronounced "quashy". i'll keep it updated to reflect the state of the research.

this research is primarily published on the Orthogonal website and discussed on the Orthogonal discord.

as a top-level view of QACI, you might want to start with:

the set of all posts relevant to QACI includes:

as overviews of QACI and how it's going:
on the formal alignment perspective within which it fits:
on the blob location problem:
on QACI as an implementation of long reflection / CEV:
- CEV can be coherent enough
- some thoughts about terminal alignment
on formalizing the QACI formal goal:
- a rough sketch of formal aligned AI using QACI with some actual math
- one-shot AI, delegating embedded agency and decision theory, and one-shot QACI
on how a formally aligned AI would actually run over time:
- AI alignment curves
- before the sharp left turn: what wins first?
on the metaethics grounding QACI:
on my view of the AI alignment research field within which i'm doing formal alignment:
- my current outlook on AI risk mitigation
- a casual intro to AI doom and alignment

Research AgendasAI

Frontpage

26

Mentioned in

99Shallow review of live agendas in alignment & safety

18Orthogonal's Formal-Goal Alignment theory of change

16formal alignment: what it is, and some proposals

13formalizing the QACI alignment formal-goal

4an Evangelion dialogue explaining the QACI alignment plan

New Comment

Moderation Log