Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Distribution-Aware Algorithm Design with LLM Agents

About

We study learning when the learned object is executable solver code rather than a predictor. In this setting, correctness is not enough: two solvers may both return valid solutions on the deployment distribution while differing substantially in runtime. Given samples from an unknown task distribution, the learner returns code evaluated on fresh instances by both solution quality and execution time. Our central abstraction is a \emph{solver hint}: reusable structure inferred from samples and compiled into specialized solver code. We prove that the empirically fastest sample-consistent solver from a fixed library generalizes in both correctness and runtime, and that statistically identifiable hints can be recovered and compiled from polynomially many samples. Empirically, we instantiate the framework with LLM code agents on \(21\) structured combinatorial-optimization target distributions across seven problem classes. The synthesized solvers reach mean normalized quality \(0.971\), improve by \(+0.224\) over the average heuristic pool and by \(+0.098\) over the highest-quality heuristic, and are \(336.9\times\), \(342.8\times\), and \(16.1\times\) faster than the quality-best heuristic, Gurobi, and the selected time-limited exact backend, respectively. On released PACE 2025 Dominating Set private instances, the synthesized solver is valid on all \(100\) graphs and runs about two orders of magnitude faster than top competition solvers, with a moderate quality gap. Inspection shows that many gains come from changing the computational scale: replacing ambient exponential search or general-purpose optimization with compiled distribution-specific computation.

Saharsh Koganti, Priyadarsi Mishra, Pierfrancesco Beneventano, Tomer Galanti• 2026

Related benchmarks

TaskDatasetResultRank
Dominating SetPACE Dominating Set 2025 (private instances)
Validity Score100
5
Dominating SetPACE 2025
Validity Score100
5
Combinatorial Optimization21 target distributions All (test)
Q_LLM0.971
1
Graph Coloring21 target distributions Coloring (test)
Quality Score (LLM)86.8
1
Maximum Independent Set21 target distributions MIS (test)
Q Score (LLM)99.2
1
Maximum Satisfiability21 target distributions MAXSAT (test)
Q_LLM1
1
Minimum Dominating SetMDS 21 target distributions (test)
Q_LLM97.3
1
Multi-dimensional Knapsack ProblemMDKP 21 target distributions (test)
Q (LLM)0.973
1
Packing Linear Programming21 target distributions Packing LP (test)
Q_LLM0.994
1
Traveling Salesperson ProblemTSP 21 target distributions (test)
Q_LLM Score0.993
1
Showing 10 of 10 rows

Other info

Follow for update