This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
Chain-of-thought Alignment
Edit
History
Subscribe
Discussion
(0)
Help improve this page (1 flag)
Edit
History
Subscribe
Discussion
(0)
Help improve this page (1 flag)
Chain-of-thought Alignment
Random Tag
Contributors
1
elifland
(Feel free to rename, and write a description)
Posts tagged
Chain-of-thought Alignment
Most
Relevant
1
43
Externalized reasoning oversight: a research direction for language model alignment
tamera
8mo
10
1
12
Paper: Large Language Models Can Self-improve [Linkpost]
Evan R. Murphy
6mo
4
1
28
Steganography in Chain of Thought Reasoning
A Ray
8mo
9
1
19
Imitation Learning from Language Feedback
Jérémy Scheurer
,
Tomek Korbak
,
Ethan Perez
9h
0
1
5
[ASoT] Simulators show us behavioural properties by default
Arun Jose
3mo
0
1
7
Distilled Representations Research Agenda
Hoagy
,
mishajw
5mo
1