User Profile

Ω111178175

Recent Posts

Curated Posts
Curated - Recent, high quality posts selected by the LessWrong moderation team.
Frontpage Posts
Posts meeting our frontpage guidelines: aim to explain, not to persuade. Avoid meta-discussion
(includes curated content and frontpage posts)
All Posts
Includes personal and meta blogposts (as well as curated and frontpage).

Assuming we've solved X, could we do Y...

1020h2 min readShow Highlight
2

Figuring out what Alice wants: non-human Alice

419h1 min readShow Highlight
7

Why we need a *theory* of human values

137d4 min readShow Highlight
0

Humans can be assigned any values whatsoever…

111mo3 min readShow Highlight
0

Disagreement with Paul: alignment induction

43mo1 min readShow Highlight
4

Using expected utility for Good(hart)

64mo6 min readShow Highlight
1

Standard ML Oracles vs Counterfactual ones

52mo5 min readShow Highlight
0

Bridging syntax and semantics with Quine's Gavagai

53mo1 min readShow Highlight
1

Web of connotations: Bleggs, Rubes, thermostats and beliefs

53mo7 min readShow Highlight
0
19

Recent Comments