AI ALIGNMENT FORUMTags
AF

Tripwire

EditHistorySubscribe
Discussion (0)
Help improve this page (2 flags)
EditHistorySubscribe
Discussion (0)
Help improve this page (2 flags)
Tripwire
Random Tag
Contributors
1plex

In AI safety, a tripwire is a mechanism designed to detect signs of misalignment in an advanced artificial intelligence and shut it down automatically.

Posts tagged Tripwire
Most Relevant
0
7Implications of Quantum Computing for Artificial Intelligence Alignment Research
Jaime Sevilla, Pablo Antonio Moreno Casares
3y
3
0
1Corrigibility thoughts I: caring about multiple things
Stuart Armstrong
6y
0
Add Posts