Assuring Agent Safety Evaluations By Analysing Transcripts
by Jerome Wynne and Cozmin Ududec
Summary This is a research update from the Science of Evaluation team at the UK AI Security Institute. In this update, we share preliminary results from analysing transcripts of agent activity that may be of interest to researchers working in the field. AISI generates thousands of transcripts when running its...
Oct 10, 20259