Learn to Relax with Large Language Models: Solving Constraint Optimization Problems via Bidirectional Coevolution
About
Large Language Model (LLM)-based optimization has recently shown promise for autonomous problem solving, yet most approaches still cast LLMs as passive constraint checkers rather than proactive strategy designers, limiting their effectiveness on complex Constraint Optimization Problems (COPs). To address this, we present AutoCO, an end-to-end Automated Constraint Optimization method that tightly couples operations-research principles of constraint relaxation with LLM reasoning. A core innovation is a unified triple-representation that binds relaxation strategies, algorithmic principles, and executable codes. This design enables the LLM to synthesize, justify, and instantiate relaxation strategies that are both principled and executable. To navigate fragmented solution spaces, AutoCO employs a bidirectional global-local coevolution mechanism, synergistically coupling Monte Carlo Tree Search (MCTS) for global relaxation-trajectory exploration with Evolutionary Algorithms (EAs) for local solution intensification. This continuous exchange of priors and feedback explicitly balances diversification and intensification, thus preventing premature convergence. Extensive experiments on three challenging COP benchmarks validate AutoCO's consistent effectiveness and superior performance, especially in hard regimes where current methods degrade. Results highlight AutoCO as a principled and effective path toward proactive, verifiable LLM-driven optimization.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Constrained Optimization | VRPTW Large | Optimality Gap42 | 11 | |
| Constrained Optimization | VRPTW-fuel Small (S) | Optimality Gap0.31 | 11 | |
| Constrained Optimization | SFL (4) | Optimality Gap2 | 11 | |
| Constrained Optimization | SFL (5(Dual)) | Optimality Gap15 | 11 | |
| Constrained Optimization | VRPTW Medium | Optimality Gap45 | 11 | |
| Constrained Optimization | SFL (8) | Optimality Gap17 | 11 | |
| Constrained Optimization | VRPTW Small (S) | Optimality Gap53 | 11 | |
| Constrained Optimization | VRPTW-fuel Medium | Optimality Gap0.00e+0 | 10 | |
| Constrained Optimization | VRPTW-fuel Large (L) | Optimality Gap0.00e+0 | 6 | |
| Vehicle Routing Problem with Time Windows | VRPTW L | Te2e Time108.9 | 4 |