1 page
Empirical laws governing how language model performance scales with compute, data, and parameters