Hard Constraints Meet Soft Generation: Guaranteed Feasibility for LLM-based Combinatorial Optimization
About
Large language models (LLMs) have emerged as promising general-purpose solvers for combinatorial optimization (CO), yet they fundamentally lack mechanisms to guarantee solution feasibility which is critical for real-world deployment. In this work, we introduce FALCON, a framework that ensures 100\% feasibility through three key innovations: (i) \emph{grammar-constrained decoding} enforces syntactic validity, (ii) a \emph{feasibility repair layer} corrects semantic constraint violations, and (iii) \emph{adaptive Best-of-$N$ sampling} allocates inference compute efficiently. To train the underlying LLM, we introduce the Best-anchored Objective-guided Preference Optimization (BOPO) in LLM training, which weights preference pairs by their objective gap, providing dense supervision without human labels. Theoretically, we prove convergence for BOPO and provide bounds on repair-induced quality loss. Empirically, across seven NP-hard CO problems, FALCON achieves perfect feasibility while matching or exceeding the solution quality of state-of-the-art neural and LLM-based solvers.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Maximum Independent Set | MIS | Feasibility Rate100 | 21 | |
| Capacitated Vehicle Routing Problem | CVRP | Feasibility100 | 21 | |
| Minimum Vertex Cover | MVC | Feasibility100 | 21 | |
| Orienteering Problem | OP | Feasibility100 | 21 | |
| Traveling Salesperson Problem | TSP | Feasibility100 | 21 | |
| Job-Shop Scheduling Problem | JSSP | Feasibility100 | 21 | |
| Permutation Flow Shop Scheduling Problem | PFSP | Feasibility1 | 21 | |
| Capacitated Vehicle Routing Problem | CVRP Small 10–30 nodes | Optimality Gap1.42 | 6 | |
| Capacitated Vehicle Routing Problem | CVRP Medium 40–60 nodes | Optimality Gap3.71 | 6 | |
| Capacitated Vehicle Routing Problem | CVRP Large 70–100 nodes | Optimality Gap6.05 | 6 |