Random Tag
Contributors
You are viewing revision 1.0.0, last edited by plex

In AI safety, a tripwire is a mechanism designed to detect signs of misalignment in an advanced artificial intelligence and shut it down automatically.