Hard Constraints Meet Soft Generation: Guaranteed Feasibility for LLM-based Combinatorial Optimization

About

Large language models (LLMs) have emerged as promising general-purpose solvers for combinatorial optimization (CO), yet they fundamentally lack mechanisms to guarantee solution feasibility which is critical for real-world deployment. In this work, we introduce FALCON, a framework that ensures 100\% feasibility through three key innovations: (i) \emph{grammar-constrained decoding} enforces syntactic validity, (ii) a \emph{feasibility repair layer} corrects semantic constraint violations, and (iii) \emph{adaptive Best-of-$N$ sampling} allocates inference compute efficiently. To train the underlying LLM, we introduce the Best-anchored Objective-guided Preference Optimization (BOPO) in LLM training, which weights preference pairs by their objective gap, providing dense supervision without human labels. Theoretically, we prove convergence for BOPO and provide bounds on repair-induced quality loss. Empirically, across seven NP-hard CO problems, FALCON achieves perfect feasibility while matching or exceeding the solution quality of state-of-the-art neural and LLM-based solvers.

Yang Liu, Chuan Zhou, Yancheng Chen, Shuai Zhang, Xixun Lin, Xiaoqing Wang• 2026

Related benchmarks

Task	Dataset	Result
Permutation Flow Shop Scheduling Problem	PFSP	Optimality Gap0.0086	24
Maximum Independent Set	MIS	Feasibility Rate100	21
Capacitated Vehicle Routing Problem	CVRP	Feasibility100	21
Minimum Vertex Cover	MVC	Feasibility100	21
Orienteering Problem	OP	Feasibility100	21
Traveling Salesperson Problem	TSP	Feasibility100	21
Job-Shop Scheduling Problem	JSSP	Feasibility100	21
Capacitated Vehicle Routing Problem	CVRP Small 10–30 nodes	Optimality Gap1.42	6
Capacitated Vehicle Routing Problem	CVRP Medium 40–60 nodes	Optimality Gap3.71	6
Capacitated Vehicle Routing Problem	CVRP Large 70–100 nodes	Optimality Gap6.05	6

Showing 10 of 19 rows

Other info

Follow for update

@wizwand_team Discord