JoshuaOSHickman — AI Alignment Forum

A Rephrasing Of and Footnote To An Embedded Agency Proposal

I wanted to clarify some comments made in this post, which proposed a way to resolve the issues brought up in the Action Counterfactuals subsection of the Embedded Agency paper. I noticed that my ideal edits would end up rewriting the post almost completely (it wasn't particularly clear), and I...

Mar 9, 20225

Exploring Decision Theories With Counterfactuals and Dynamic Agent Self-Pointers

This is a follow-up to A Possible Resolution To Spurious Counterfactuals, which was addressing a technical problem in self-proof. See the original post for the suggestion provided for a solution to the 5 and 10 problem as described in the Embedded Agency paper. I suggested, at the end of that...

Dec 18, 20212

A Possible Resolution To Spurious Counterfactuals

Spurious counterfactuals (perhaps an easier handle than "the lobian inference issues described in Section 2.1 here, in the Embedded Agency paper ), are important to address because they lead to inference problems. Having a function (or agent) rely on proofs about itself to generate its output immediately introduces Lob's Theorem-related...

Dec 6, 202115