Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design

About

Handcrafting heuristics for solving complex optimization tasks (e.g., route planning and task allocation) is a common practice but requires extensive domain knowledge. Recently, Large Language Model (LLM)-based automatic heuristic design (AHD) methods have shown promise in generating high-quality heuristics without manual interventions. Existing LLM-based AHD methods employ a population to maintain a fixed number of top-performing LLM-generated heuristics and introduce evolutionary computation (EC) to iteratively enhance the population. However, these population-based procedures cannot fully develop the potential of each heuristic and are prone to converge into local optima. To more comprehensively explore the space of heuristics, this paper proposes to use Monte Carlo Tree Search (MCTS) for LLM-based heuristic evolution. The proposed MCTS-AHD method organizes all LLM-generated heuristics in a tree structure and can better develop the potential of temporarily underperforming heuristics. In experiments, MCTS-AHD delivers significantly higher-quality heuristics on various complex tasks. Our code is available.

Zhi Zheng, Zhuoliang Xie, Zhenkun Wang, Bryan Hooi• 2025

Related benchmarks

Task	Dataset	Result
Online Bin Packing	Online BPP (test)	Gap (%)0.43	120
Capacitated Vehicle Routing Problem	CVRP N=100	Objective Value15.58	87
Traveling Salesman Problem	TSP50	Optimality Gap2.13	77
Traveling Salesman Problem	TSP-100	--	69
Online Bin Packing	Weibull distribution	Gap (%)0.25	63
Traveling Salesperson Problem	TSP-100	--	42
Traveling Salesperson Problem	TSP N=500 Generalization (128 instances)	Optimality Gap0.867	41
Traveling Salesman Problem	TSP-200	--	41
Capacitated Vehicle Routing Problem	CVRP N=20 10,000 instances (test)	Objective Value4.921	38
Maximum Independent Set	RB [200-300]	MIS Size20.09	38

Showing 10 of 178 rows

...

Other info

Follow for update

@wizwand_team Discord