This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
Wikitags
METR (org)
Edited by
Ruby
last updated
1st Jul 2024
Formerly ARC Evals
Subscribe
Subscribe
Discussion
0
Discussion
0
Posts tagged
METR (org)
Most Relevant
77
METR: Measuring AI Ability to Complete Long Tasks
Zach Stein-Perlman
5mo
18
70
ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks
Beth Barnes
2y
4
53
METR's Evaluation of GPT-5
GradientDissenter
1mo
0
61
Clarifying METR's Auditing Role
Beth Barnes
1y
0
25
Interpreting the METR Time Horizons Post
snewman
5mo
0
31
CoT May Be Highly Informative Despite “Unfaithfulness” [METR]
GradientDissenter
1mo
0