AI ALIGNMENT FORUM
AF

378
davekasten
Ω2000
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
0davekasten's Shortform
1y
0
Buck's Shortform
davekasten1y00

Possibly misguided question given the context -- I see you incorporating imperfect information in "the attack fails silently", why not also a distinction between "the attack succeeds noisily, the AI wins and we know it won" and "the attack succeeds silently, the AI wins and we don't know it won" ? 

Reply
Fields that I reference when thinking about AI takeover prevention
davekasten1y35

So, I really, really am not trying to be snarky here but am worried this comment will come across this way regardless.  I think this is actually quite important as a core factual question given that you've been around this community for a while, and I'm asking you in your capacity as "person who's been around for a minute".  It's non-hyperbolically true that no one has published this sort of list before in this community?  

I'm asking, because if that's the case, someone should, e.g., just write a series of posts that just marches through US government best-practices documents on these domains (e.g., Chemical Safety Board, DoD NISPOM, etc.) and draws out conclusions on AI policy.   

Reply
29A Narrow Path: a plan to deal with AI extinction risk
1y
0