Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Hard Constraints Meet Soft Generation: Guaranteed Feasibility for LLM-based Combinatorial Optimization

About

Large language models (LLMs) have emerged as promising general-purpose solvers for combinatorial optimization (CO), yet they fundamentally lack mechanisms to guarantee solution feasibility which is critical for real-world deployment. In this work, we introduce FALCON, a framework that ensures 100\% feasibility through three key innovations: (i) \emph{grammar-constrained decoding} enforces syntactic validity, (ii) a \emph{feasibility repair layer} corrects semantic constraint violations, and (iii) \emph{adaptive Best-of-$N$ sampling} allocates inference compute efficiently. To train the underlying LLM, we introduce the Best-anchored Objective-guided Preference Optimization (BOPO) in LLM training, which weights preference pairs by their objective gap, providing dense supervision without human labels. Theoretically, we prove convergence for BOPO and provide bounds on repair-induced quality loss. Empirically, across seven NP-hard CO problems, FALCON achieves perfect feasibility while matching or exceeding the solution quality of state-of-the-art neural and LLM-based solvers.

Yang Liu, Chuan Zhou, Yancheng Chen, Shuai Zhang, Xixun Lin, Xiaoqing Wang• 2026

Related benchmarks

TaskDatasetResultRank
Maximum Independent SetMIS
Feasibility Rate100
21
Capacitated Vehicle Routing ProblemCVRP
Feasibility100
21
Minimum Vertex CoverMVC
Feasibility100
21
Orienteering ProblemOP
Feasibility100
21
Traveling Salesperson ProblemTSP
Feasibility100
21
Job-Shop Scheduling ProblemJSSP
Feasibility100
21
Permutation Flow Shop Scheduling ProblemPFSP
Feasibility1
21
Capacitated Vehicle Routing ProblemCVRP Small 10–30 nodes
Optimality Gap1.42
6
Capacitated Vehicle Routing ProblemCVRP Medium 40–60 nodes
Optimality Gap3.71
6
Capacitated Vehicle Routing ProblemCVRP Large 70–100 nodes
Optimality Gap6.05
6
Showing 10 of 19 rows

Other info

Follow for update