This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
Experiments
•
Applied to
0.202 Bits of Evidence In Favor of Futarchy
by
niplav
7d
ago
•
Applied to
Pomodoro Method Randomized Self Experiment
by
niplav
7d
ago
•
Applied to
[Paper] Hidden in Plain Text: Emergence and Mitigation of Steganographic Collusion in LLMs
by
Yohan Mathew
11d
ago
•
Applied to
Who Feels More Alone?
by
marvinscheffold
14d
ago
•
Applied to
Michael Dickens' Caffeine Tolerance Research
by
niplav
1mo
ago
•
Applied to
Inference-Only Debate Experiments Using Math Problems
by
Arjun Panickssery
2mo
ago
•
Applied to
The need for multi-agent experiments
by
Martín Soto
2mo
ago
•
Applied to
Notifications Received in 30 Minutes of Class
by
Mir
4mo
ago
•
Applied to
My hour of memoryless lucidity
by
Gunnar Zarncke
5mo
ago
•
Applied to
Claude wants to be conscious
by
Joe Kwon
6mo
ago
•
Applied to
Announcing Neuronpedia: Platform for accelerating research into Sparse Autoencoders
by
Johnny Lin
6mo
ago
•
Applied to
Increasing IQ by 10 Points is Possible
by
jacobjacob
7mo
ago
•
Applied to
Exploring OpenAI's Latent Directions: Tests, Observations, and Poking Around
by
Johnny Lin
8mo
ago
•
Applied to
Some negative steganography results
by
kave
10mo
ago
•
Applied to
Please Bet On My Quantified Self Decision Markets
by
niplav
10mo
ago
•
Applied to
Extrapolating from Five Words
by
Gordon Seidoh Worley
11mo
ago
•
Applied to
Go flash blinking lights at printed text right now
by
lukehmiles
1y
ago