This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
Self Fulfilling/Refuting Prophecies
•
Applied to
FixDT
by
Abram Demski
6d
ago
•
Applied to
Sexual Abuse attitudes might be infohazardous
by
Chipmonk
2mo
ago
•
Applied to
Stop-gradients lead to fixed point predictions
by
Johannes Treutlein
10mo
ago
•
Applied to
Proper scoring rules don’t guarantee predicting fixed points
by
Johannes Treutlein
1y
ago
•
Applied to
How evolutionary lineages of LLMs can plan their own future and act on these plans
by
Roman Leventov
1y
ago
•
Applied to
Training goals for large language models
by
Johannes Treutlein
1y
ago
•
Applied to
Conditioning Generative Models for Alignment
by
Arun Jose
1y
ago
•
Applied to
Encouragement to Instill Confidence?
by
Jim Babcock
2y
ago
•
Applied to
Descriptive vs. prescriptive optimism
by
Yoav Ravid
2y
ago
•
Applied to
Politics is way too meta
by
DirectedEvolution
2y
ago
•
Applied to
Luck II: Expecting White Swans
by
DirectedEvolution
2y
ago
•
Applied to
Decision Theories: A Semi-Formal Analysis, Part II
by
DirectedEvolution
2y
ago
•
Applied to
Random Thoughts on Predict-O-Matic
by
DirectedEvolution
2y
ago
•
Applied to
Defining Myopia
by
DirectedEvolution
2y
ago
•
Applied to
Notes on Optimism, Hope, and Trust
by
DirectedEvolution
2y
ago
•
Applied to
An example of self-fulfilling spurious proofs in UDT
by
DirectedEvolution
2y
ago
•
Applied to
Fifty Shades of Self-Fulfilling Prophecy
by
DirectedEvolution
2y
ago
•
Applied to
Omega and self-fulfilling prophecies
by
DirectedEvolution
2y
ago
•
Applied to
Self-fulfilling values of time
by
DirectedEvolution
2y
ago