Learning to Solve and Optimize by Evolving Code
About
Combinatorial and optimization problems are fundamental to many industrial AI applications. Solving large-scale real-world instances of such problems typically requires careful problem formalization, specialized solvers, and expert-designed heuristics. Thus, experts need to specify not only what solutions are, but also how they are derived. By introducing the tool CHECKMATE, we show that algorithm generation via code evolution represents a paradigm shift by eliminating the need to formulate the how. CHECKMATE solely relies on the what. Specifically, a formal specification ensures solutions' correctness and enables systematic performance evaluation of the generated programs, while a natural language description guides the evolutionary process. The effectiveness of our method is demonstrated on selected problems from two industrial domains: configuration and scheduling. In all cases, the evolved algorithms consistently outperform state-of-the-art solvers. This underscores the potential of formal methods in guiding code evolution for automatically solving complex real-world problems.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Hospital-Cabinet-Problem | HCP Moderate | Solving Percentage100 | 6 | |
| Component-Configurator-Problem | CCP Hard split | Solving Percentage96 | 3 | |
| Component-Configurator-Problem | CCP Overall (test) | Solving Percentage89 | 3 | |
| Energy-aware Dual-Flexible Job-Shop Scheduling Problem | E-DFJSP (Easy) | Solving Percentage100 | 3 | |
| Energy-aware Dual-Flexible Job-Shop Scheduling Problem | E-DFJSP Moderate | Solving Percentage100 | 3 | |
| Energy-aware Dual-Flexible Job-Shop Scheduling Problem | E-DFJSP (Hard split) | Solving Percentage100 | 3 | |
| Energy-aware Dual-Flexible Job-Shop Scheduling Problem | E-DFJSP Overall (test) | Solving Percentage100 | 3 | |
| Hospital-Cabinet-Problem | HCP Easy | Solving Percentage100 | 3 | |
| Hospital-Cabinet-Problem | HCP (Hard split) | Solving Percentage100 | 3 | |
| Hospital-Cabinet-Problem | HCP Overall (test) | Solving Percentage100 | 3 |