This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
3293
METR (org) — AI Alignment Forum
Wikitags
METR (org)
Edited by
Ruby
last updated
1st Jul 2024
Formerly ARC Evals
Subscribe
Discussion
Subscribe
Discussion
Posts tagged
METR (org)
Most Relevant
77
METR: Measuring AI Ability to Complete Long Tasks
Zach Stein-Perlman
7mo
18
70
ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks
Beth Barnes
2y
4
53
METR's Evaluation of GPT-5
GradientDissenter
3mo
0
61
Clarifying METR's Auditing Role
Beth Barnes
1y
0
27
Interpreting the METR Time Horizons Post
snewman
6mo
0
31
CoT May Be Highly Informative Despite “Unfaithfulness” [METR]
GradientDissenter
3mo
0