Artificial Sandwiching: When can we test scalable alignment protocols without humans? — AI Alignment Forum