Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Spend Less, Fit Better: Budget-Efficient Scaling Law Fitting via Active Experiment Selection

About

Scaling laws are used to plan multi-million-dollar training runs, but fitting those laws can itself cost millions. In modern large-scale workflows, assembling a sufficiently informative set of pilot experiments is already a major budget-allocation problem rather than a routine preprocessing step. We formulate scaling-law fitting as budget-aware sequential experimental design: given a finite pool of runnable experiments with heterogeneous costs, choose which runs to execute so as to maximize extrapolation accuracy in a high-cost target region. We then propose an uncertainty-aware method for sequentially allocating experimental budget toward the runs most useful for target-region extrapolation. Across a diverse benchmark of scaling-law tasks, our method consistently outperforms classical design-based baselines, and often approaches the performance of fitting on the full experimental set while using only about 10% of the total training budget. Our code is available at https://github.com/PlanarG/active-sl.

Sijie Li, Shanda Li, Haowei Lin, Weiwei Sun, Ameet Talwalkar, Yiming Yang• 2026

Related benchmarks

TaskDatasetResultRank
Data ScienceScaling Law Discovery lr&bsz
R2 Score (Target Region)0.22
19
Target-region R2 predictionvocab scaling law instances
Target-Region R20.98
19
Target-region R2 predictionmoe scaling law instances
Target-region R20.83
19
Target-region R2 predictiondata con scaling law instances
Target-region R20.86
19
Target-region R2 predictionSparsity Scaling Law Instances
Target-Region R20.53
19
Target-region R2 predictionfarseer scaling law instances
Target-region R20.93
19
Target-region R2 predictiondomain scaling law instances
Target-region R20.95
19
Target-region R2 predictionparallel scaling law instances
Target-region R20.99
19
Showing 8 of 8 rows

Other info

Follow for update