AI ALIGNMENT FORUM
AF

1237

Scaling Laws

Edited by riley, plex last updated 18th Jun 2023

Scaling Laws refer to the observed trend that the scaling behaviors of deep neural networks (i.e. how the evaluation metric of interest varies as one varies the amount of compute used for training (or inference), number of model parameters, training dataset size, model input size, or number of training steps) follows variants of power laws.

External links

"Broken Neural Scaling Laws" paper

Scaling laws graph from Scaling Laws for Neural Language Models

1

1

Posts tagged Scaling Laws

97chinchilla's wild implications

3y

13

86What will GPT-2030 look like?

2y

2

29Thoughts on the Alignment Implications of Scaling Language Models

4y

2

65o1: A Technical Primer

11mo

8

44Paper: On measuring situational awareness in LLMs

Owain_Evans, Daniel Kokotajlo, Mikita Balesni, Tomek Korbak, Asa Cooper Stickland, Meg, Maximilian Kaufmann

2y

13

23Inverse Scaling Prize: Second Round Winners

Ian McKenzie, Sam Bowman, Ethan Perez

3y

3

19NVIDIA and Microsoft releases 530B parameter transformer model, Megatron-Turing NLG

4y

1

15[Link] Training Compute-Optimal Large Language Models

4y

7

12Inverse scaling can become U-shaped

3y

13

13The effect of horizon length on scaling laws

3y

2

59Announcing the Inverse Scaling Prize ($250k Prize Pool)

Ethan Perez, Ian McKenzie, Sam Bowman

3y

1

49How much chess engine progress is about adapting to bigger computers?

paulfchristiano

4y

10

53Scaling Laws for Reward Model Overoptimization

leogao, John Schulman, Jacob_Hilton

3y

5

33Compute Trends Across Three eras of Machine Learning

Jsevillamol, Pablo Villalobos, lennart, Marius Hobbhahn, Tamay Besiroglu, anson.ho

4y

5

33Causal confusion as an argument against the scaling hypothesis

RobertKirk, David Scott Krueger (formerly: capybaralet)

3y

19

Load More (15/26)

Add Posts