Stop-gradients lead to fixed point predictions
by Johannes Treutlein, Caspar Oesterheld, Rubi J. Hudson, and Emery Cooper
Johannes Treutlein and Rubi Hudson worked on this post as part of SERI MATS, under the mentorship of Evan Hubinger. Rubi has also received mentorship from Leo Gao. We thank Erik Jenner for helpful discussions and Alexander Pan for bringing the performative prediction literature to our attention. Update 30 May...
Jan 28, 202337