Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design
About
Handcrafting heuristics for solving complex optimization tasks (e.g., route planning and task allocation) is a common practice but requires extensive domain knowledge. Recently, Large Language Model (LLM)-based automatic heuristic design (AHD) methods have shown promise in generating high-quality heuristics without manual interventions. Existing LLM-based AHD methods employ a population to maintain a fixed number of top-performing LLM-generated heuristics and introduce evolutionary computation (EC) to iteratively enhance the population. However, these population-based procedures cannot fully develop the potential of each heuristic and are prone to converge into local optima. To more comprehensively explore the space of heuristics, this paper proposes to use Monte Carlo Tree Search (MCTS) for LLM-based heuristic evolution. The proposed MCTS-AHD method organizes all LLM-generated heuristics in a tree structure and can better develop the potential of temporarily underperforming heuristics. In experiments, MCTS-AHD delivers significantly higher-quality heuristics on various complex tasks. Our code is available.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Online Bin Packing | Weibull distribution | Gap (%)0.25 | 63 | |
| Traveling Salesman Problem | TSP50 | Optimality Gap2.13 | 58 | |
| Traveling Salesman Problem | TSP-100 | -- | 53 | |
| Capacitated Vehicle Routing Problem | CVRP N=100 | Objective Value15.683 | 50 | |
| Automated Heuristic Discovery | AHD Individual (Instance-wise) | Average Tardiness4.01e+3 | 28 | |
| Capacitated Vehicle Routing Problem | CVRP N=20 10,000 instances (test) | Objective Value4.921 | 26 | |
| Traveling Salesman Problem | TSP N=200 | Cost Gap0.49 | 24 | |
| Online Bin Packing Problem | BPP online N=1k, W=100 | Optimality Gap3.19 | 23 | |
| Online Bin Packing Problem | BPP online N=5k, W=100 | Optimality Gap2.09 | 23 | |
| Multidimensional Knapsack Problem | MKP | Objective Value103.1 | 21 |