AI ALIGNMENT FORUM
AF

1427
Wikitags

Natural Abstraction

Edited by Raemon last updated 10th Oct 2022

The Natural Abstraction hypothesis says that:

Our physical world abstracts well: for most systems, the information relevant “far away” from the system (in various senses) is much lower-dimensional than the system itself. These low-dimensional summaries are exactly the high-level abstract objects/concepts typically used by humans.

These abstractions are “natural”: a wide variety of cognitive architectures will learn to use approximately the same high-level abstract objects/concepts to reason about the world.

(from "Testing the Natural Abstraction Hypothesis")

Subscribe
Discussion
1
Subscribe
Discussion
1
Posts tagged Natural Abstraction
7
101Natural Abstractions: Key Claims, Theorems, and Critiques
LawrenceC, Leon Lang, Erik Jenner
3y
9
5
40Natural Latents: The Concepts
johnswentworth, David Lorell
2y
3
2
49Natural Latents: The Math
johnswentworth, David Lorell
2y
6
2
64Alignment By Default
johnswentworth
5y
72
2
61Testing The Natural Abstraction Hypothesis: Project Intro
johnswentworth
4y
25
2
28What is a Tool?
johnswentworth, David Lorell
1y
0
2
23The Natural Abstraction Hypothesis: Implications and Evidence
CallumMcDougall
4y
3
2
29Public Static: What is Abstraction?
johnswentworth
5y
2
1
33Testing The Natural Abstraction Hypothesis: Project Update
johnswentworth
4y
5
2
24Agency As a Natural Abstraction
Thane Ruthenis
4y
1
1
13[ASoT] Natural abstractions and AlphaZero
Ulisse Mini
3y
0
2
73The Plan - 2022 Update
johnswentworth
3y
14
1
67A rough and incomplete review of some of John Wentworth's research
So8res
3y
3
1
50Natural Latents: Latent Variables Stable Across Ontologies
johnswentworth, David Lorell
2mo
0
1
31Idealized Agents Are Approximate Causal Mirrors (+ Radical Optimism on Agent Foundations)
Thane Ruthenis
2y
7
Load More (15/40)
Add Posts