RoCo: Role-Based LLMs Collaboration for Automatic Heuristic Design

About

Automatic Heuristic Design (AHD) has gained traction as a promising solution for solving combinatorial optimization problems (COPs). Large Language Models (LLMs) have emerged and become a promising approach to achieving AHD, but current LLM-based AHD research often only considers a single role. This paper proposes RoCo, a novel Multi-Agent Role-Based System, to enhance the diversity and quality of AHD through multi-role collaboration. RoCo coordinates four specialized LLM-guided agents-explorer, exploiter, critic, and integrator-to collaboratively generate high-quality heuristics. The explorer promotes long-term potential through creative, diversity-driven thinking, while the exploiter focuses on short-term improvements via conservative, efficiency-oriented refinements. The critic evaluates the effectiveness of each evolution step and provides targeted feedback and reflection. The integrator synthesizes proposals from the explorer and exploiter, balancing innovation and exploitation to drive overall progress. These agents interact in a structured multi-round process involving feedback, refinement, and elite mutations guided by both short-term and accumulated long-term reflections. We evaluate RoCo on five different COPs under both white-box and black-box settings. Experimental results demonstrate that RoCo achieves superior performance, consistently generating competitive heuristics that outperform existing methods including ReEvo and HSEvo, both in white-box and black-box scenarios. This role-based collaborative paradigm establishes a new standard for robust and high-performing AHD.

Jiawei Xu, Feng-Feng Wei, Wei-Neng Chen• 2025

Related benchmarks

Task	Dataset	Result
Capacitated Vehicle Routing Problem	CVRP N=100	Objective Value15.706	95
Traveling Salesman Problem	TSP50	Optimality Gap0.018	77
Traveling Salesman Problem	TSP-200	Optimality Gap0.188	46
Traveling Salesman Problem	TSP N=20	Optimality Gap0.00e+0	45
Capacitated Vehicle Routing Problem	CVRP-200	Objective Value27.782	43
Traveling Salesman Problem	TSP100	Optimality Gap (%)0.1	37
Traveling Salesman Problem	TSP N=100	--	32
Multidimensional Knapsack Problem	MKP	Objective Value103.6	27
Traveling Salesman Problem	TSP N=200	--	27
Orienteering Problem	OP	Objective Value57.365	24

Showing 10 of 24 rows

Other info

Follow for update

@wizwand_team Discord