Forecasting progress in language models
by Matthew Barnett and Metaculus
Note: this post was cross-posted to Metaculus over a week ago as part of their new Metaculus journal. Here, I describe a way of measuring the performance of language models, and extrapolate this measure using publicly available data on benchmarks. The result is a (surprisingly) short timeline to "human-level"—within one...
Oct 28, 202162