What is Scaling Laws?

Scaling Laws — AI Glossary

Scaling Laws

Definition Empirically discovered relationships between model size, training data, compute budget, and AI model performance that help predict how capabilities improve with scale.

In Depth

Scaling laws describe the predictable mathematical relationships between the resources invested in training AI models (model size, dataset size, compute budget) and their resulting performance. Research by Kaplan et al. at OpenAI (2020) showed that model loss follows power-law relationships with these factors, meaning performance improves smoothly and predictably as any of these resources increase.

These laws enable organizations to forecast model performance before investing in expensive training runs. They show that performance improvements require exponentially more resources: achieving a 10% improvement in capability might require doubling the compute budget. This helps explain why frontier AI development is concentrated among well-resourced organizations and why efficiency improvements (doing more with less compute) are so valuable.

Scaling laws have driven strategic decisions across the AI industry. They influenced the decision to train ever-larger models (GPT-4, Claude, Gemini) and motivated the Chinchilla scaling analysis showing that many models were undertrained relative to their size. They also reveal diminishing returns, spurring interest in complementary approaches: better data quality, improved architectures, inference-time compute (reasoning models), and test-time scaling.

Scaling Laws

In Depth

Browse more terms