Ari Brill. AI safety researcher, currently Research Affiliate at Principles of Intelligence (PIBBSS). Former astrophysicist.
In your usage, are "scheming" and "deceptive alignment" synonyms, or would you distinguish those terms in any way?
In your usage, are "scheming" and "deceptive alignment" synonyms, or would you distinguish those terms in any way?