Squiggle Maximizer (formerly "Paperclip maximizer") - History

•

Applied to Towards an Ethics Calculator for Use by an AGI by Sean Sweeney 5mo ago

•

Applied to Reaction to "Empowerment is (almost) All We Need" : an open-ended alternative by Ryo 5mo ago

•

Applied to Out of the Box by jesseduffield 5mo ago

•

Applied to Non-superintelligent paperclip maximizers are normal by Adam Zerner 6mo ago

•

Applied to Nature < Nurture for AIs by scottviteri 10mo ago

•

Applied to Will Artificial Superintelligence Kill Us? by James_Miller 11mo ago

•

Applied to But What If We Actually Want To Maximize Paperclips? by snerx 11mo ago

•

Applied to Prediction: any uncontrollable AI will turn earth into a giant computer by Karl von Wendt 1y ago

Ryan Greenblatt v1.12.0Apr 5th 2023 (+316/-20)

Any future ~~AGI,~~AGI with full power over the lightcone, if it is not to destroy ~~us,~~most potential from a human perspective, must have something sufficiently close to human values as its terminal value (goal). Further, seemingly small deviations could result in losing most of the value. Human values ~~don't~~seem unlikely to spontaneously emerge in a generic optimization ~~process.~~process^[1]. A dependably safe AI would therefore have to be programmed explicitly with human values or programmed with the ability (including the goal) of inferring human values.

^{^}
Though it's conceivable that empirical versions of moral realism could hold in practice.

•

Applied to An Appeal to AI Superintelligence: Reasons to Preserve Humanity by James_Miller 1y ago

•

Applied to PaperclipGPT(-4) by Michael Tontchev 1y ago

•

Applied to Alignment works both ways by Karl von Wendt 1y ago

Raymond Arnold v1.11.0Feb 17th 2023 (+73/-61)

Historical Note: This was originally called a "paperclip maximizer", with paperclips chosen for illustrative purposes because it is very unlikely to be implemented, and has little apparent danger or emotional load (in contrast to, for example, curing cancer or winning wars). Many people interpreted this to be about an AI that was specifically given the instruction of manufacturing paperclips, and that the intended lesson was of an outer alignment failure. i.e humans failed to give the AI the correct goal. ~~The~~Yudkowsky has since stated the originally intended lesson was of inner alignment failure, wherein the humans gave the AI some other goal, but the AI's internal processes converged on a goal that seems completely arbitrary from the human perspective.)

First ~~discussed in conversations between~~mentioned by Yudkowsky ~~and Bostrom (circa 2003)~~on the extropian's mailing list, a squiggle maximizer is an artificial general intelligence (AGI) whose goal is to maximize the number of molecular squiggles in its collection.

Raymond Arnold v1.10.0Feb 17th 2023 (+873/-471)

A ~~Paperclip~~Squiggle Maximizer is a hypothetical artificial intelligence whose utility function values something that humans would consider almost worthless, like maximizing the number of ~~paperclips~~paperclip-shaped-molecular-squiggles in the universe. The ~~paperclip~~squiggle maximizer is the canonical thought experiment showing how an artificial general intelligence, even one designed competently and without malice, could ultimately destroy humanity. The thought experiment shows that AIs with apparently innocuous values could pose an existential threat.

The goal of maximizing paperclips is chosen for illustrative purposes because it is very unlikely to be implemented, and has little apparent danger or emotional load (in contrast to, for example, curing cancer or winning wars). This produces a thought experiment which shows the contingency of human values: An extremely powerful optimizer (a highly intelligent agent) could seek goals that are completely alien to ours (orthogonality thesis), and as a side-effect destroy us by consuming resources essential to our survival.

Historical Note: This was originally called a "paperclip maximizer", with paperclips chosen for illustrative purposes because it is very unlikely to be implemented, and has little apparent danger or emotional load (in contrast to, for example, curing cancer or winning wars). Many people interpreted this to be about an AI that was specifically given the instruction of manufacturing paperclips, and that the intended lesson was of an outer alignment failure. i.e humans failed to give the AI the correct goal. The originally intended lesson was of inner alignment failure, wherein the humans gave the AI some other goal, but the AI's internal processes converged on a goal that seems completely arbitrary from the human perspective.)

First ~~described by~~discussed in conversations between Yudkowsky and Bostrom ~~(2003)~~(circa 2003), a ~~paperclip~~squiggle maximizer is an artificial general intelligence (AGI) whose goal is to maximize the number of ~~paperclips~~molecular squiggles in its collection. ~~If it has been constructed with a roughly human level of general intelligence, the AGI might collect paperclips, earn money to buy paperclips, or begin to manufacture paperclips.~~

Michael Camacho v1.9.0Feb 16th 2023 (+9/-6)

Orthogonality thesis: It's possible to have an AI with a high level of general intelligence which does not reach the same moral conclusions that humans do. Some people might intuitively think that something so smart ~~should~~shouldn't want something as "stupid" as paperclips, but there are possible minds with high intelligence that pursue any number of different goals.
Instrumental convergence: The paperclip maximizer only cares about paperclips, but maximizing them implies taking control of all matter and energy within reach, as well as other goals like preventing itself from being shut off or having its goals changed. " The AI does not hate you, nor does it love you, but you are made out of atoms which it can use for something else ."