This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Wikitags
AF
Login
Subscribe
Discussion
0
AI Benchmarking
Subscribe
Discussion
0
This page is a stub.
Posts tagged
AI Benchmarking
Most Relevant
1
16
Improving Model-Written Evals for AI Safety Benchmarking
Sunishchal Dev
,
Marius Hobbhahn
5mo
0
0
5
Auto-Enhance: Developing a meta-benchmark to measure LLM agents’ ability to improve other agents
Sam Brown
,
Basil Labib
,
Codruta Lugoj
,
Sai Sasank Y
8mo
0
0
5
MMLU’s Moral Scenarios Benchmark Doesn’t Measure What You Think it Measures
corey morris
1y
2