AI ALIGNMENT FORUMTags
AF

AI-assisted/AI automated Alignment

EditHistorySubscribe
Discussion (0)
Help improve this page (1 flag)
EditHistorySubscribe
Discussion (0)
Help improve this page (1 flag)
AI-assisted/AI automated Alignment
Random Tag
Contributors
6Ruben Bloom

Not obviously the best name for this tag, but maybe good to explore/rename. Wiki-tags are publicly editable!

Posts tagged AI-assisted/AI automated Alignment
Most Relevant
2
64Cyborgism
Nicholas Kees Dupuis, janus
1mo
20
1
42Beliefs and Disagreements about Automating Alignment Research
Ian McKenzie
7mo
0
1
50Godzilla Strategies
johnswentworth
10mo
17
1
24Cyborg Periods: There will be multiple AI transitions
Jan_Kulveit, rosehadshar
1mo
2
0
15A survey of tool use and workflows in alignment research
Logan Riggs Smith, Jan Hendrik Kirchner, janus, Jacques Thibodeau
1y
3
1
5Model-driven feedback could amplify alignment failures
Aidan O'Gara
2mo
0
1
5Making it harder for an AGI to "trick" us, with STVs
Tor Økland Barstad
9mo
0
1
5Getting from an unaligned AGI to an aligned AGI?
Tor Økland Barstad
9mo
0
1
4AI-assisted list of ten concrete alignment things to do right now
Luke H Miles
7mo
1
1
4Alignment with argument-networks and assessment-predictions
Tor Økland Barstad
4mo
0
1
80Ngo and Yudkowsky on alignment difficulty
Eliezer Yudkowsky, Richard Ngo
1y
53
1
40[Link] Why I’m optimistic about OpenAI’s alignment approach
Jan Leike
4mo
10
1
31Results from a survey on tool use and workflows in alignment research
Jacques Thibodeau, Jan Hendrik Kirchner, janus, Logan Riggs Smith
3mo
0
1
35Human Mimicry Mainly Works When We’re Already Close
johnswentworth
7mo
4
0
29Prize for Alignment Research Tasks
Andreas Stuhlmüller, William Saunders
1y
6
Load More (15/18)
Add Posts