AI ALIGNMENT FORUM
AF

Luke Bailey
Ω9100
Message
Dialogue
Subscribe

Stanford PhD Student

Posts

Sorted by New
25Image Hijacks: Adversarial Images can Control Generative Models at Runtime
2y
1
10Tensor Trust: An online game to uncover prompt injection vulnerabilities
2y
0
9Examples of Prompts that Make GPT-4 Output Falsehoods
2y
0

Wikitag Contributions

No wikitag contributions to display.

Comments

Sorted by
Newest
No Comments Found