Running out of high-quality text to pretrain on. Model size growth outpaces global production of novel text. Villalobos et al. (2024) estimates this happens within the current decade.
The paper is unusually optimistic here. Several countervailing forces:
Naive iterated training on self-generated data → model collapse (Shumailov et al. 2024).
But: forms of test-time scaling that improve base-model generations and iteratively distill those improvements back work (this is the AlphaZero pattern again — see 08 - Pathway 3 — Recursive Self-Improvement). The key is having a quality filter or verifier — like win/loss in chess.
"When is third-party experience sufficient in practice for learning to plan and act, without fuelling self-delusions?"
This is Ortega et al. 2021's result: training on observational data of other agents acting can be causally insufficient for learning to make decisions yourself. You can imitate without understanding cause.
This bears directly on whether you can train AGI by watching humans — vs. needing the AI to act in the world.
"If the progress from AGI to ASI is mainly driven by scaling compute and models, then scaling up data generation, simulation, and collection at a similar pace through more compute may be possible, leading to data availability being a friction but not a fundamental blocker."
Friction, not wall. Probably.