AI ALIGNMENT FORUM
AF

Wikitags

Abstraction

Edited by satpugnet, adamShimi last updated 25th Jul 2025

Abstraction is the process of simplifying a system by capturing only the essential features needed for your purpose, while deliberately ignoring irrelevant details. In AI alignment, effective abstraction means creating models or concepts that genuinely reflect what matters for reasoning or control, not just convenient proxies. If the abstraction misses important structure, it can fail dramatically when optimized or applied in new situations. The challenge is to develop abstractions that remain valid and useful, even as systems scale or face new pressures.

(This is a stub, please rewrite if you have a better tag description).

Subscribe
2
Subscribe
2
Discussion0
Discussion0
Posts tagged Abstraction
11What is Abstraction?
johnswentworth
6y
2
10Abstraction = Information at a Distance
johnswentworth
5y
0
64Alignment By Default
johnswentworth
5y
72
29Public Static: What is Abstraction?
johnswentworth
5y
2
30Writing Causal Models Like We Write Programs
johnswentworth
5y
1
20Pointing to a Flower
johnswentworth
5y
8
23(A -> B) -> A in Causal DAGs
johnswentworth
6y
6
18Motivating Abstraction-First Decision Theory
johnswentworth
5y
15
11Trace README
johnswentworth
5y
0
14Logical Representation of Causal Models
johnswentworth
6y
0
14The Indexing Problem
johnswentworth
5y
0
16Cartesian Boundary as Abstraction Boundary
johnswentworth
5y
1
11Causal Abstraction Toy Model: Medical Sensor
johnswentworth
6y
2
15Formulating Reductive Agency in Causal Models
johnswentworth
6y
0
15Problems Involving Abstraction?
Q
johnswentworth
5y
Q
8
Load More (15/53)
Add Posts