Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design
About
Handcrafting heuristics for solving complex optimization tasks (e.g., route planning and task allocation) is a common practice but requires extensive domain knowledge. Recently, Large Language Model (LLM)-based automatic heuristic design (AHD) methods have shown promise in generating high-quality heuristics without manual interventions. Existing LLM-based AHD methods employ a population to maintain a fixed number of top-performing LLM-generated heuristics and introduce evolutionary computation (EC) to iteratively enhance the population. However, these population-based procedures cannot fully develop the potential of each heuristic and are prone to converge into local optima. To more comprehensively explore the space of heuristics, this paper proposes to use Monte Carlo Tree Search (MCTS) for LLM-based heuristic evolution. The proposed MCTS-AHD method organizes all LLM-generated heuristics in a tree structure and can better develop the potential of temporarily underperforming heuristics. In experiments, MCTS-AHD delivers significantly higher-quality heuristics on various complex tasks. Our code is available.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Online Bin Packing | Online BPP (test) | Gap (%)0.43 | 120 | |
| Capacitated Vehicle Routing Problem | CVRP N=100 | Objective Value15.58 | 73 | |
| Traveling Salesman Problem | TSP50 | Optimality Gap2.13 | 64 | |
| Online Bin Packing | Weibull distribution | Gap (%)0.25 | 63 | |
| Traveling Salesman Problem | TSP-100 | -- | 56 | |
| Traveling Salesperson Problem | TSP-100 | -- | 42 | |
| Traveling Salesperson Problem | TSP N=200 (Generalization (128 instances)) | Optimality Gap0.204 | 35 | |
| Maximum Independent Set | SATLIB | MIS Size425.9 | 35 | |
| Online Bin Packing Problem | BPP online N=1k, W=100 | Optimality Gap3.19 | 35 | |
| Traveling Salesperson Problem | TSP N=100 (test) | Optimality Gap0.5 | 33 |