Spend Less, Fit Better: Budget-Efficient Scaling Law Fitting via Active Experiment Selection
About
Scaling laws are used to plan multi-million-dollar training runs, but fitting those laws can itself cost millions. In modern large-scale workflows, assembling a sufficiently informative set of pilot experiments is already a major budget-allocation problem rather than a routine preprocessing step. We formulate scaling-law fitting as budget-aware sequential experimental design: given a finite pool of runnable experiments with heterogeneous costs, choose which runs to execute so as to maximize extrapolation accuracy in a high-cost target region. We then propose an uncertainty-aware method for sequentially allocating experimental budget toward the runs most useful for target-region extrapolation. Across a diverse benchmark of scaling-law tasks, our method consistently outperforms classical design-based baselines, and often approaches the performance of fitting on the full experimental set while using only about 10% of the total training budget. Our code is available at https://github.com/PlanarG/active-sl.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Data Science | Scaling Law Discovery lr&bsz | R2 Score (Target Region)0.22 | 19 | |
| Target-region R2 prediction | vocab scaling law instances | Target-Region R20.98 | 19 | |
| Target-region R2 prediction | moe scaling law instances | Target-region R20.83 | 19 | |
| Target-region R2 prediction | data con scaling law instances | Target-region R20.86 | 19 | |
| Target-region R2 prediction | Sparsity Scaling Law Instances | Target-Region R20.53 | 19 | |
| Target-region R2 prediction | farseer scaling law instances | Target-region R20.93 | 19 | |
| Target-region R2 prediction | domain scaling law instances | Target-region R20.95 | 19 | |
| Target-region R2 prediction | parallel scaling law instances | Target-region R20.99 | 19 |