Spend Less, Fit Better: Budget-Efficient Scaling Law Fitting via Active Experiment Selection

About

Scaling laws are used to plan multi-million-dollar training runs, but fitting those laws can itself cost millions. In modern large-scale workflows, assembling a sufficiently informative set of pilot experiments is already a major budget-allocation problem rather than a routine preprocessing step. We formulate scaling-law fitting as budget-aware sequential experimental design: given a finite pool of runnable experiments with heterogeneous costs, choose which runs to execute so as to maximize extrapolation accuracy in a high-cost target region. We then propose an uncertainty-aware method for sequentially allocating experimental budget toward the runs most useful for target-region extrapolation. Across a diverse benchmark of scaling-law tasks, our method consistently outperforms classical design-based baselines, and often approaches the performance of fitting on the full experimental set while using only about 10% of the total training budget. Our code is available at https://github.com/PlanarG/active-sl.

Sijie Li, Shanda Li, Haowei Lin, Weiwei Sun, Ameet Talwalkar, Yiming Yang• 2026

Related benchmarks

Task	Dataset	Result
Data Science	Scaling Law Discovery lr&bsz	R2 Score (Target Region)0.22	19
Target-region R2 prediction	vocab scaling law instances	Target-Region R20.98	19
Target-region R2 prediction	moe scaling law instances	Target-region R20.83	19
Target-region R2 prediction	data con scaling law instances	Target-region R20.86	19
Target-region R2 prediction	Sparsity Scaling Law Instances	Target-Region R20.53	19
Target-region R2 prediction	farseer scaling law instances	Target-region R20.93	19
Target-region R2 prediction	domain scaling law instances	Target-region R20.95	19
Target-region R2 prediction	parallel scaling law instances	Target-region R20.99	19

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord