AI ALIGNMENT FORUM
AF

Wikitags

Consistent Glomarization

Edited by Morphism last updated 5th Dec 2024

Glomarization is responding to a question with "I can neither confirm nor deny" or something else similarly ambiguous. From Consistent Glomarization Should be Feasible:

It has to be done consistently, to avoid problems like:

LAWYER: Did you ever sleep with him in New York?

WITNESS: I refuse to answer that question.

LAWYER: Did you ever sleep with him in Chicago?

WITNESS: I refuse to answer that question.

LAWYER: Did you ever sleep with him in Miami?

WITNESS: No

Consistent glomarization is the policy of glomarizing when there is a sufficiently high probability measure, from the epistemic perspective of the person asking you the question, on counterfactual selves who would not want to answer honestly. When done well, this can allow you to conceal information while maintaining a code of total honesty.

Subscribe
1
Subscribe
1
Discussion0
Discussion0
Posts tagged Consistent Glomarization
39Counterfactual Mugging Poker Game
Scott Garrabrant
7y
0
Add Posts