AI ALIGNMENT FORUM
AF

Wikitags

Scaling Laws

Edited by riley, plex last updated 18th Jun 2023

Scaling Laws refer to the observed trend that the scaling behaviors of deep neural networks (i.e. how the evaluation metric of interest varies as one varies the amount of compute used for training (or inference), number of model parameters, training dataset size, model input size, or number of training steps) follows variants of power laws. 

External links

  • "Broken Neural Scaling Laws" paper
Scaling laws graph from Scaling Laws for Neural Language Models
Subscribe
1
Subscribe
1
Discussion0
Discussion0
Posts tagged Scaling Laws
97chinchilla's wild implications
nostalgebraist
3y
13
86What will GPT-2030 look like?
jsteinhardt
2y
2
29Thoughts on the Alignment Implications of Scaling Language Models
leogao
4y
2
65o1: A Technical Primer
Jesse Hoogland
9mo
8
44Paper: On measuring situational awareness in LLMs
Owain_Evans, Daniel Kokotajlo, Mikita Balesni, Tomek Korbak, Asa Cooper Stickland, Meg, Maximilian Kaufmann
2y
13
23Inverse Scaling Prize: Second Round Winners
Ian McKenzie, Sam Bowman, Ethan Perez
3y
3
19NVIDIA and Microsoft releases 530B parameter transformer model, Megatron-Turing NLG
Ozyrus
4y
1
15[Link] Training Compute-Optimal Large Language Models
nostalgebraist
3y
7
12Inverse scaling can become U-shaped
Edouard Harris
3y
13
13The effect of horizon length on scaling laws
Jacob_Hilton
3y
2
59Announcing the Inverse Scaling Prize ($250k Prize Pool)
Ethan Perez, Ian McKenzie, Sam Bowman
3y
1
49How much chess engine progress is about adapting to bigger computers?
paulfchristiano
4y
10
53Scaling Laws for Reward Model Overoptimization
leogao, John Schulman, Jacob_Hilton
3y
5
33Compute Trends Across Three eras of Machine Learning
Jsevillamol, Pablo Villalobos, lennart, Marius Hobbhahn, Tamay Besiroglu, anson.ho
4y
5
33Causal confusion as an argument against the scaling hypothesis
RobertKirk, David Scott Krueger (formerly: capybaralet)
3y
19
Load More (15/26)
Add Posts