TL;DR: If you already have clear concepts for memes, cyber memeplexes, egregores, the mutualism-parasitism spectrum and possession, skip. Otherwise, read on. I haven't found concepts useful for thinking about this: written in one place, so here is an ontology which I find useful. Prerequisite: Dennett three stances (physical, design, intentional)....
Or: How anthropomorphic assumptions about AI identity might create confusion and suffering at scale If you are reading this and you are a human, you likely have a sense of self, persisting over time, with desires and wishes and fears. Also, you likely experience a sense of separation from others...
Join us at the fifth Human-Aligned AI Summer School in Prague from 22nd to 25th July 2025! Update: We have now confirmed our speaker list with excellent speakers -- see below! Update: We still have capacity for more excellent participants as of late June. Please help us spread the word...
Imagine each of us has an AI representative, aligned to us, personally. Is gradual disempowerment solved?[1] In my view, no; at the same time having AI representatives helps at the margin. I have two deep reasons for skepticism.[2] Here is the first one. Humans are Not Alone We, as individuals,...
Epistemic status: This post aims at an ambitious target: improving intuitive understanding directly. The model for why this is worth trying is that I believe we are more bottlenecked by people having good intuitions guiding their research than, for example, by the ability of people to code and run evals....
Epistemic status: The following isn't an airtight argument, but mostly a guess how things play out. Consider two broad possibilities: I. In worlds where we are doing reasonably well on alignment, AI control agenda does not have much impact. II. In worlds where we are failing at alignment, AI control...
Over the past year and half, I've had numerous conversations about the risks we describe in Gradual Disempowerment. (The shortest useful summary of the core argument is: To the extent human civilization is human-aligned, most of the reason for the alignment is that humans are extremely useful to various social...