Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PathWise: Planning through World Model for Automated Heuristic Design via Self-Evolving LLMs

About

Large Language Models (LLMs) have enabled automated heuristic design (AHD) for combinatorial optimization problems (COPs), but existing frameworks' reliance on fixed evolutionary rules and static prompt templates often leads to myopic heuristic generation, redundant evaluations, and limited reasoning about how new heuristics should be derived. We propose a novel multi-agent reasoning framework, referred to as Planning through World Model for Automated Heuristic Design via Self-Evolving LLMs (PathWise), which formulates heuristic generation as a sequential decision process over an entailment graph serving as a compact, stateful memory of the search trajectory. This approach allows the system to carry forward past decisions and reuse or avoid derivation information across generations. A policy agent plans evolutionary actions, a world model agent generates heuristic rollouts conditioned on those actions, and critic agents provide routed reflections summarizing lessons from prior steps, shifting LLM-based AHD from trial-and-error evolution toward state-aware planning through reasoning. Experiments across diverse COPs show that PathWise converges faster to better heuristics, generalizes across different LLM backbones, and scales to larger problem sizes.

Oguzhan Gungordu, Siheng Xiong, Faramarz Fekri• 2026

Related benchmarks

TaskDatasetResultRank
Online Bin PackingWeibull distribution
Gap (%)0.29
63
Traveling Salesman ProblemTSP50
Optimality Gap0.47
58
Online Bin Packing ProblemBPP online N=1k, W=100
Optimality Gap2.46
23
Online Bin Packing ProblemBPP online N=5k, W=100
Optimality Gap0.92
23
Multi-dimensional Knapsack ProblemMKP N=100, m=5
Objective Value22.05
21
Traveling Salesman ProblemTSP N=100
Cost (%)1.85
20
Traveling Salesman ProblemTSP N=50 (test)
Optimality Gap8.41
19
Online Bin Packing ProblemWeibull BPP 5k 500
Optimality Gap (%)0.26
17
Online Bin Packing ProblemWeibull BPP 10k 100
Gap to Lower Bound0.84
17
Online Bin Packing ProblemWeibull BPP 10k 500
Gap to LB (%)0.23
17
Showing 10 of 27 rows

Other info

Follow for update