Memory-Guided Tree Search with Cross-Branch Knowledge Transfer for LLM Solver Synthesis

About

Combinatorial optimization (CO) underlies decision-making from logistics to chip design, where infeasible solutions are operationally unusable and small quality gains translate into substantial economic value. Recent work uses large language models (LLMs) to automate solver synthesis: generating executable solver programs from natural-language specifications. However, existing tree-search and evolutionary agents refine candidate trajectories in parallel without explicit knowledge transfer, reintroducing the same constraint violations and converging on similar algorithm families. We introduce MEMOIR, a memory-guided tree-search framework with a two-level memory hierarchy: branch-local memory preserves execution-grounded refinement details within a branch as it iterates on a single algorithmic design, while global memory stores compressed algorithmic and failure-mode summaries across branches. A reflection step at branch termination distills these summaries, enabling cross-branch transfer without polluting future contexts with low-level debugging traces. Across seven CO problems spanning scheduling, routing, packing, and geometric design, MEMOIR achieves 96.7% solution validity (a 9.2 point gap over the strongest baseline) and improves the average normalized score by 7.3 points at matched per-method execution budget. Over three independent runs on four problems, MEMOIR's run-to-run validity standard deviation is more than an order of magnitude below that of every baseline we evaluated in this setting, suggesting that memory-guided exploration yields consistent improvements rather than reflecting sampling variance.

Fatemeh Haji, Javier Delarosa Quiros, Peyman Najafirad• 2026

Related benchmarks

Task	Dataset	Result
Combinatorial Optimization	Resource Constrained Shortest Path (test)	Average Score84.47	17
Combinatorial Optimization	Overall (test)	Average Performance73.01	17
Combinatorial Optimization	Aircraft Landing (test)	Average Score87.51	17
Combinatorial Optimization	Euclidean Steiner (test)	Average Performance81.83	15
Combinatorial Optimization	Periodic Vehicle Routing (test)	Average Value0.869	14
Combinatorial Optimization	Container Loading with Weight Restrictions (test)	Average Objective Value0.0058	12
Combinatorial Optimization	Crew Scheduling (test)	Average Performance62.4	12
Combinatorial Optimization	Container Loading (test)	Avg Performance83.64	11

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord