x

AI ALIGNMENT FORUM

AF

Scaling Laws — AI Alignment Forum

Scaling Laws

Edited by riley, plex last updated 18th Jun 2023

Scaling Laws refer to the observed trend that the scaling behaviors of deep neural networks (i.e. how the evaluation metric of interest varies as one varies the amount of compute used for training (or inference), number of model parameters, training dataset size, model input size, or number of training steps) follows variants of power laws.

External links

"Broken Neural Scaling Laws" paper

Scaling laws graph from Scaling Laws for Neural Language Models

Add Posts

1

1

Posts tagged Scaling Laws

2

97chinchilla's wild implications

4y

13

4

86What will GPT-2030 look like?

3y

2

0

29Thoughts on the Alignment Implications of Scaling Language Models

5y

2

1

65o1: A Technical Primer

2y

8

1

45Paper: On measuring situational awareness in LLMs

Owain_Evans, Daniel Kokotajlo, Mikita Balesni, Tomek Korbak, Asa Cooper Stickland, Meg, Maximilian Kaufmann

3y

13

1

23Inverse Scaling Prize: Second Round Winners

Ian McKenzie, Sam Bowman, Ethan Perez

3y

3

1

19NVIDIA and Microsoft releases 530B parameter transformer model, Megatron-Turing NLG

5y

1

1

15[Link] Training Compute-Optimal Large Language Models

4y

7

1

12Inverse scaling can become U-shaped

4y

13

1

13The effect of horizon length on scaling laws

3y

2

1

59Announcing the Inverse Scaling Prize ($250k Prize Pool)

Ethan Perez, Ian McKenzie, Sam Bowman

4y

1

0

49How much chess engine progress is about adapting to bigger computers?

paulfchristiano

5y

10

1

53Scaling Laws for Reward Model Overoptimization

leogao, John Schulman, Jacob_Hilton

4y

5

1

33Compute Trends Across Three eras of Machine Learning

Jsevillamol, Pablo Villalobos, lennart, Marius Hobbhahn, Tamay Besiroglu, anson.ho

4y

5

1

33Causal confusion as an argument against the scaling hypothesis

RobertKirk, David Scott Krueger

4y

19

Load More (15/26)

Add Posts