Slow corporations as an intuition pump for AI R&D automation

elifland

You might expect the labor force of NormalCorp to be roughly in equilibrium where they gain equally from spending more on compute as they gain from spending on salaries (to get more/better employees).
[...]
However, I'm quite skeptical of this type of consideration making a big difference because the ML industry has already varied the compute input massively, with over 7 OOMs of compute difference between research now (in 2025) vs at the time of AlexNet 12 years ago, (invalidating the view that there is some relatively narrow range of inputs in which neither input is bottlenecking) and AI companies effectively can't pay more to get faster or much better employees, so we're not at a particularly privileged point in human AI R&D capabilities.

SlowCorp has 625K H100s per researcher. What do you even do with that much compute if you drop it into this world? Is every researcher just sweeping hyperparameters on the biggest pretraining runs? I'd normally say "scale up pretraining another factor of 100" and then expect that SlowCorp could plausibly outperform NormalCorp, except you've limited them to 1 week and a similar amount of total compute, so they don't even have that option (and in fact they can't even run normal pretraining runs, since those take longer than 1 week to complete).

The quality and amount of labor isn't the primary problem here. The problem is that the current practices for AI development are specialized to the current labor:compute ratio, and can't just be changed on a dime if you drastically change the ratio. Sure, the compute input has varied massively over 7 OOMs; importantly this did not happen all at once, the ecosystem adapted to it.

SlowCorp would be in a much better position if it was in a world where AI development had evolved with these kinds of bottlenecks existing all along. Frontier pretraining runs would be massively more parallel, and would complete in a day. There would be dramatically more investment in automation of hyperparameter sweeps and scaling analyses, rather than depending on human labor to do that. The inference-time compute paradigm would have started 1-2 years earlier, and would be significantly more mature. How fast would AI progress be in that world if you are SlowCorp? I agree it would still be slower than current AI progress, but it is really hard to guess how much slower, and it's definitely drastically faster than if you just impute a SlowCorp in today's world (which mostly seems like it will flounder and die immediately).

So we can break down the impacts into two categories:

SlowCorp is slower because of less access to resources. This is the opposite for AutomatedCorp, so you'd expect it to be correspondingly faster.
SlowCorp is slower because AI development is specialized to the current labor:compute ratio. This is not the opposite for AutomatedCorp, if anything it will also slow down AutomatedCorp (but in practice it probably doesn't affect AutomatedCorp since there is so much serial labor for AutomatedCorp to fix the issue).

If you want to pump your intuition for what AutomatedCorp should be capable of, the relevant SlowCorp is the one that only faces the first problem, that is, you want to consider the SlowCorp that evolved in a world with those constraints in place all along, not the SlowCorp thrown into a research ecosystem not designed for the constraints it faces. Personally, once I try to imagine that I just run into a wall of "who even knows what that world looks like" and fail to have my intuition pumped.

[-]Davidmanheim7mo10

This seems mostly right, except that it's often hard to parallelize work and manage large projects - which seems like it slows thing importantly. And, of course, some things are strongly serialized using time that can't be sped up via more compute or more people. (See: PM hires 9 women to have baby in one month.)

Similarly, running 1,000 AI research groups in parallel might get you the same 20 insights 50 times, rather than generating far more insights. And managing and integrating the research, and deciding where to allocate research time, plausibly gets harder at more than a linear rate with more groups.

So overall, the model seems correct, but I think the 10x speed up is more likely than the 20x speed up.

[-]ryan_greenblatt7mo30

I agree parallelization penalties might bite hard in practice. But it's worth noting that the AIs in the AutomatedCorp hypothetical also run 50x faster and are more capable.

(A strong marginal parallelization penalty exponent of 0.4 would render the 50x additional workers equivalent to a 5x improvement in labor speed, substantially smaller than the 50x speed improvement.)

[-]gwern7mo712

Maybe it would be helpful to start using some toy models of DAGs/tech trees to get an idea of how wide/deep ratios affect the relevant speedups. It sounds like so far that much of this is just people having warring intuitions about 'no, the tree is deep and narrow and so slowing down/speeding up workers doesn't have that much effect because Amdahl's law so I handwave it at ~1x speed' vs 'no, I think it's wide and lots of work-arounds to any slow node if you can pay for the compute to bypass them and I will handwave it at 5x speed'.

[-]ryan_greenblatt7mo20

This isn't that important, but I think the idea of using an exponential parallelization penalty is common in the economics literature. I specifically used 0.4 as around the harshest penalty I've heard of. I believe this number comes from some studies on software engineering where they found something like this.

I'm currently skeptical that toy models of DAGs/tech trees will add much value over:

Looking at how parallelized AI R&D is right now.
Looking at what people typically find in the economics literature.

(Separately AIs might be notably better at coordinating than humans are which might change things substantially. Toy models of this might be helpful.)

These employees were the best that NormalCorp could find while hiring aggressively over a few years as well as a smaller core of more experienced researchers and engineers (around 300) who've worked in AI for longer. They have some number of the best employees working in AI (perhaps they have 1/5 of the best 1000 people on earth), but most of their employees are more like typical tech employees: what NormalCorp could hire in a few years with high salaries and an aim to recruit rapidly. ↩︎
And below median, but that shouldn't have as big of an effect as removing the above median employees. ↩︎
These employees were the best that NormalCorp could find while hiring aggressively over a few years as well as a smaller core of more experienced researchers and engineers (around 300) who've worked in AI for longer. They have some number of the best employees working in AI (perhaps they have 1/5 of the best 1000 people on earth), but most of their employees are more like typical tech employees: what NormalCorp could hire in a few years with high salaries and an aim to recruit rapidly. ↩︎
Roughly 1.5-3x smaller than OpenAI's current computational resources ↩︎
These are basically just the estimates for the number of copies and speed at the point of superhuman AI researchers in AI 2027, but I get similar numbers if I do the estimate myself. Note that (at least for my estimates) the 50x speed includes accounting for AIs working 24/7 (a factor of 3) and being better at coordinating and sharing state with weaker models so they can easily complete some tasks faster. It's plausible that heavy inference time compute use implies that we'll initially have a smaller number of slower AI researchers, but we should still expect that quantity and speed will quickly increase after this is initially achieved. So, you can think about this scenario as being what happens after allowing for some time for costs to drop. This scenario occurring a bit after initial automation doesn't massively alter the bottom line takeaways. (That said, if inference time compute allows for greatly boosting capabilities, then at the time when we have huge numbers of fast AI researchers matching the best humans, we might also be able to run a smaller number of researchers which are substantially qualitatively superhuman.) ↩︎
Interestingly, this implies that AI runtime compute use is comparable to human. Producing a second of cognition from a human takes perhaps 1e14 to 1e15 FLOP or between 1/10 to 1 H100 seconds. We're imagining that AI inference takes 1/5 of an H100 second to produce a second of cognition. While inference requirements are similar in this scenario, I'm imagining that training requirements start substantially higher than human lifetime FLOP. (I'm imagining the AI was trained for roughly 1e28 flop while human lifetime FLOP is more like 1e24.) This seems roughly right as I think we should expect faster inference but bigger training requirements, at least after a bit of adaptation time etc., based on how historical AI progress goes. But this is not super clear cut. ↩︎
And we condition on reaching this level of capability prior to 2032 so that it is easier to understand the relevant regime, and on the relevant AI company going full steam ahead without external blockers. ↩︎
The picture is a bit messy because I expect AI progress will start slowing due to slowed compute scaling by around 2030 or so (if we don't achieve very impressive AI by this point). This is partially due to continued compute scaling requiring very extreme quantities of investment by this point and partially due to fab capacity running out as ML chips eat up a larger and larger share of fab capacity. In such a regime, I expect a somewhat higher fraction of the progress will be algorithmic (rather than from scaling compute or from finding additional data), though not by that much as algorithmic progress is driven by additional compute instead of additional data. Also, the rate of algorithmic progress will be slower at an absolute level. So, 20x faster algorithmic progress will yield a higher overall progress multiplier, but progress will also be generally slower. So, you'll maybe get a lower number of 2024-equivalent years of progress, but a higher number of 2031-equivalent years of progress. ↩︎

	SlowCorp	NormalCorp
Analog to	NormalCorp with 50x slower, 5x less numerous employees, and lower ceiling on employee quality	Future frontier AI company
Time to work on AI R&D	1 week	1 year
Number of AI researchers and engineers	800	4,000
Researcher/engineer quality	Median frontier AI company researcher/engineer	Similar to current frontier AI companies if they expanded rapidly^[1]
H100s	500 million	10 million
Cumulative H100-years	10 million	10 million

	SlowCorp	NormalCorp	AutomatedCorp
Analog to	NormalCorp with 50x slower, 5x less numerous employees, and lower ceiling on employee quality	Future frontier AI company	Future company with fully automated AI R&D
Time to work on AI R&D	1 week	1 year	50 years
Number of AI researchers and engineers	800	4,000	200,000
Researcher/engineer quality	Median frontier AI company researcher/engineer	Similar to current frontier AI companies if they expanded rapidly^[3]	Level of world's 100 best researchers/engineers
H100s	500 million	10 million	200,000^[4]
Cumulative H100-years	10 million	10 million	10 million

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

48

Slow corporations as an intuition pump for AI R&D automation

48

The intuition pump

Clarifications

Asymmetries

Implications