AdaEvolve: Adaptive LLM Driven Zeroth-Order Optimization
About
The paradigm of automated program generation is shifting from one-shot generation to inference-time search, where Large Language Models (LLMs) function as semantic mutation operators within evolutionary loops. While effective, these systems are currently governed by static schedules that fail to account for the non-stationary dynamics of the search process. This rigidity results in substantial computational waste, as resources are indiscriminately allocated to stagnating populations while promising frontiers remain under-exploited. We introduce AdaEvolve, a framework that reformulates LLM-driven evolution as a hierarchical adaptive optimization problem. AdaEvolve uses an "accumulated improvement signal" to unify decisions across three levels: Local Adaptation, which dynamically modulates the exploration intensity within a population of solution candidates; Global Adaptation, which routes the global resource budget via bandit-based scheduling across different solution candidate populations; and Meta-Guidance which generates novel solution tactics based on the previously generated solutions and their corresponding improvements when the progress stalls. We demonstrate that AdaEvolve consistently outperforms the open-sourced baselines across 185 different open-ended optimization problems including combinatorial, systems optimization and algorithm design problems.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Mathematical Optimization | Circle Packing Rect n=21 | Best Objective Value2.361 | 10 | |
| Mathematical Optimization | Heilbronn Convex n=13 | Best Result0.029 | 10 | |
| Mathematical Optimization | Heilbronn Triangles n=11 | Best Result0.036 | 10 | |
| Mathematical Optimization | Circle Packing Square n=26 | Best Result2.636 | 10 | |
| Mathematical Optimization | MinMax Distance n=3 | Best Result0.2404 | 10 | |
| Systems Optimization | ADRS Prism | Best Score26.37 | 9 | |
| Systems Optimization | ADRS LLM-SQL | Best Score0.775 | 9 | |
| Systems Optimization | ADRS NS3 | Best Score131.8 | 9 | |
| Transaction scheduling | ADRS TXN | Best4.35e+3 | 9 | |
| Multi-cloud data storage cost minimization | ADRS Cloudcast | Best Cost637.1 | 9 |