AI ALIGNMENT FORUMTags
AF

Gears-Level

EditHistorySubscribe
Discussion (0)
Help improve this page
EditHistorySubscribe
Discussion (0)
Help improve this page
Gears-Level
Random Tag
Contributors
2Ruben Bloom
1brook
1B Jacobs

A gears-level model is 'well-constrained' in the sense that there is a strong connection between each of the things you observe-- it would be hard for you to imagine that one of the variables could be different while all of the others remained the same.

Related Tags: Anticipated Experiences, Double-Crux, Empiricism, Falsifiability, Map and Territory


The term gears-level was first described on LW in the post "Gears in Understanding":...

(Read More)

Posts tagged Gears-Level
Most Relevant
3
22Toward a New Technical Explanation of Technical Explanation
Abram Demski
5y
2
0
56Attempted Gears Analysis of AGI Intervention Discussion With Eliezer
Zvi
1y
0
1
46Evolution of Modularity
johnswentworth
3y
5
1
15Abstraction, Evolution and Gears
johnswentworth
3y
4
0
55interpreting GPT: the logit lens
nostalgebraist
2y
10
1
38Current themes in mechanistic interpretability research
Lee Sharkey, Sid Black, Beren Millidge
3mo
2
1
32Decision Transformer Interpretability
Joseph Bloom, Paul Colognese
2d
2
0
11Beware of black boxes in AI alignment research
Vladimir Slepnev
5y
0
1
17Value Formation: An Overarching Model
Thane Ruthenis
3mo
10
Add Posts