AI ALIGNMENT FORUM
Wikitags
AF

Subscribe
Discussion0
1

AI Misuse

Subscribe
Discussion0
1
Written by Raymond Arnold last updated 1st May 2023

AI misuse. Humans using AI in a way that harms humanity.

Posts tagged AI Misuse
31Managing catastrophic misuse without robust AIs
Ryan Greenblatt, Buck Shlegeris
1y
4
14Adversarial Robustness Could Help Prevent Catastrophic Misuse
ao
2y
15
53Covert Malicious Finetuning
Tony Wang, dannyhalawi
1y
3
11Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation
Soroush Pour, rusheb, Quentin Feuillade--Montixi, Arush Tagade, Stephen Casper
2y
1
Add Posts