AI ALIGNMENT FORUM
AF

Wikitags

Simulator Theory

Edited by Jozdien, TristanTrim, et al. last updated 20th May 2025

Simulator Theory (in the context of AI) is an ontology or frame for understanding the working of large generative models, such as the GPT series from OpenAI. Broadly it views these models as simulating a learned distribution with various degrees of fidelity, which in the case of language models trained on a large corpus of text is the mechanics underlying the process that generated that corpus, which may be understood as the people writing, or the dynamics they write about.

It can also refer to an alignment research agenda, that deals with better understanding simulator conditionals, effects of downstream training, alignment-relevant properties such as myopia and agency in the context of language models, and using them as alignment research accelerators. See also: Cyborgism

Subscribe
2
Subscribe
2
Discussion0
Discussion0
Posts tagged Simulator Theory
141Simulators
janus
3y
90
47Conditioning Predictive Models: Large language models as predictors
evhub, Adam Jermyn, Johannes Treutlein, Rubi J. Hudson, kcwoolverton
3y
3
31Why Simulator AIs want to be Active Inference AIs
Jan_Kulveit, rosehadshar
2y
4
58The Waluigi Effect (mega-post)
Cleo Nardo
3y
26
19How to Control an LLM's Behavior (why my P(DOOM) went down)
RogerDearnaley
2y
0
8Goodbye, Shoggoth: The Stage, its Animatronics, & the Puppeteer – a New Metaphor
RogerDearnaley
2y
0
21Simulacra are Things
janus
3y
7
28Conditioning Generative Models for Alignment
Jozdien
3y
8
53'simulator' framing and confusions about LLMs
Beth Barnes
3y
1
19 [Simulators seminar sequence] #1 Background & shared assumptions
Jan, Charlie Steiner, Logan Riggs, janus, jacquesthibs, metasemi, Michael Oesterle, Lucas Teixeira, peligrietzer, remember
3y
3
22Agents vs. Predictors: Concrete differentiating factors
evhub
3y
1
109GPTs are Predictors, not Imitators
Eliezer Yudkowsky
2y
12
39Inner Misalignment in "Simulator" LLMs
Adam Scherlis
3y
3
42Two problems with ‘Simulators’ as a frame
ryan_greenblatt
3y
9
37Conditioning Predictive Models: Outer alignment via careful conditioning
evhub, Adam Jermyn, Johannes Treutlein, Rubi J. Hudson, kcwoolverton
3y
6
Load More (15/37)
Add Posts