Assuring Agent Safety Evaluations By Analysing Transcripts
Summary This is a research update from the Science of Evaluation team at the UK AI Security Institute. In this update, we share preliminary results from analysing transcripts of agent activity that may be of interest to researchers working in the field. AISI generates thousands of transcripts when running its...
Oct 10, 20257