[Linkpost] Scaling Laws for Generative Mixed-Modal Language Models

Amal

[Linkpost] Scaling Laws for Generative Mixed-Modal Language Models

by Amal

1 min read12th Jan 20232 comments

15

Language ModelsScaling LawsAI

Frontpage

This is a linkpost for https://arxiv.org/pdf/2301.03728.pdf

In this paper authors explore the scaling properties of mixed-modal generative models, discovering new scaling laws that unify the contributions of individual modalities and the interactions between them. I find most interesting that they have found so-called competition barrier - when training with multiple modalities, after a certain number of parameters/data, the loss is smaller than if the modalities were trained independently. This seems to predict cross-modal transfer that was sought after but not found (yet) with GATO.

New Comment

2 comments, sorted by

top scoring

Click to highlight new comments since: Today at 1:36 PM

[-]Quintin Pope1y20

Your link seems broken.

Reply

[-]Amal 1y10

it is fixed now, thanks!

Reply

Moderation Log