This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
Moderation Log
Deleted Comments
Users Banned From Posts
Users Banned From Users
Moderated Users
Rejected Posts
Rejected Comments
Moderation Log
Deleted Comments
Comment Author
Post
Deleted By User
Deleted Date
Deleted Public
Reason
Nicholas Goldowsky-Dill
Impact stories for model internals: an exercise for interpretability researchers
Nicholas Goldowsky-Dill
12d
false
feedback on draft
Nicholas Goldowsky-Dill
Impact stories for model internals: an exercise for interpretability researchers
Nicholas Goldowsky-Dill
12d
false
feedback on draft
Nicholas Goldowsky-Dill
Impact stories for model internals: an exercise for interpretability researchers
Nicholas Goldowsky-Dill
12d
false
feedback on draft
Nicholas Goldowsky-Dill
Impact stories for model internals: an exercise for interpretability researchers
Nicholas Goldowsky-Dill
12d
false
feedback on draft
Nicholas Goldowsky-Dill
Impact stories for model internals: an exercise for interpretability researchers
Nicholas Goldowsky-Dill
12d
false
feedback on draft
Nicholas Goldowsky-Dill
Impact stories for model internals: an exercise for interpretability researchers
Nicholas Goldowsky-Dill
12d
false
feedback on draft
rosehadshar
Paradigms and Theory Choice in AI: Adaptivity, Economy and Control
RobertM
1mo
false
removing suggestion from published comments
rosehadshar
Paradigms and Theory Choice in AI: Adaptivity, Economy and Control
RobertM
1mo
false
removing suggestion from published comments
rosehadshar
Paradigms and Theory Choice in AI: Adaptivity, Economy and Control
RobertM
1mo
false
removing suggestion from published comments
Ryan Greenblatt
Against Almost Every Theory of Impact of Interpretability
ryan_greenblatt
1mo
true
Moved below other comment
Load More (10/135)
Users Banned From Posts
Author
Post
Banned Users
michaelcohen
Asymptotically Unambitious AGI
GPT2
Users Banned From Users
ID
Banned From Frontpage
Banned from Personal Posts
Noosphere89
Phil Tanny
Eliezer Yudkowsky
shminux
rank-biserial
Ruben Bloom
mike_hawke
Viliam
PatrickDFarley
Stuart Anderson
Ericf
Liav Koren
[DEACTIVATED] Duncan Sabien
frontier64
Kaj Sotala
Thomas Kwa
Zack M. Davis
Said Achmiz
lsusr
Christian Kleineidam
shminux
Jason Maguire
Christian Kleineidam
shminux
Jason Maguire
DirectedEvolution
Christian Kleineidam
Said Achmiz
TAG
Christian Kleineidam
Said Achmiz
TAG
TurnTrout
[anonymous]
Ofer
Optimization Process
chinese5
Pee Doom
Said Achmiz
[DEACTIVATED] Duncan Sabien
Load More (10/22)
Moderated Users
Rate Limited Users
User
Ended at
Type
kcrosley-leisurelabs
3mo
allComments
Iknownothing
5mo
allPosts
Tensor White
1mo
allComments
Gerald Monroe
1mo
allPosts
Gerald Monroe
1mo
allComments
monkymind
8mo
allPosts
monkymind
8mo
allComments
Rejected Posts
Rejected Comments