AI ALIGNMENT FORUMTags
AF

Scaling Laws

EditHistorySubscribe

Help improve this page

EditHistorySubscribe

Help improve this page

Contributors

You are viewing revision 1.3.0, last edited by plex

Scaling Laws refer to the observed trend of some machine learning architectures (notably transformers) to scale their performance on predictable power law when given more compute, data, or parameters (model size), assuming they are not bottlenecked on one of the other resources. This has been observed as highly consistent over more than six orders of magnitude.

Scaling laws graph from Scaling Laws for Neural Language Models

Posts tagged Scaling Laws

2

95chinchilla's wild implications

2y

13

4

85What will GPT-2030 look like?

Jacob Steinhardt

10mo

2

0

29Thoughts on the Alignment Implications of Scaling Language Models

3y

2

1

44Paper: On measuring situational awareness in LLMs

Owain Evans, Daniel Kokotajlo, Mikita Balesni, Tomek Korbak, Lukas Berglund, Asa Cooper Stickland, Meg, Maximilian Kaufmann

8mo

13

1

23Inverse Scaling Prize: Second Round Winners

Ian McKenzie, Sam Bowman, Ethan Perez

1y

3

1

19NVIDIA and Microsoft releases 530B parameter transformer model, Megatron-Turing NLG

3y

1

1

15[Link] Training Compute-Optimal Large Language Models

2y

7

1

12Inverse scaling can become U-shaped

1y

13

1

13The effect of horizon length on scaling laws

1y

2

1

59Announcing the Inverse Scaling Prize ($250k Prize Pool)

Ethan Perez, Ian McKenzie, Sam Bowman

2y

1

0

49How much chess engine progress is about adapting to bigger computers?

Paul Christiano

3y

10

1

53Scaling Laws for Reward Model Overoptimization

leogao, John Schulman, Jacob Hilton

2y

5

1

33Compute Trends Across Three eras of Machine Learning

Jaime Sevilla, Pablo Villalobos, lennart, Marius Hobbhahn, Tamay Besiroglu, Anson Ho

2y

5

1

33Causal confusion as an argument against the scaling hypothesis

Robert Kirk, David Scott Krueger

2y

19

1

33Trends in GPU price-performance

Marius Hobbhahn, Tamay Besiroglu

2y

2