A Unified Knowledge Embedded Reinforcement Learning-based Framework for Generalized Capacitated Vehicle Routing Problems
About
The Capacitated Vehicle Routing Problem (CVRP) is a fundamental NP-hard problem with broad applications in logistics and transportation. Real-world CVRPs often involve diverse objectives and complex constraints, such as time windows or backhaul requirements, motivating the development of a unified solution framework. Recent reinforcement learning (RL) approaches have shown promise in combinatorial optimization, yet they rely on end-to-end learning and lack explicit problem-solving knowledge, limiting solution quality. In this paper, we propose a knowledge-embedded framework inspired by the Route-First Cluster-Second heuristics. It incorporates knowledge at two levels: (1) decomposing CVRPs into the route-first and cluster-second subproblems, and (2) leveraging dynamic programming to solve the second subproblem, whose results guide the RL-based constructive solver to solve the first problem. To mitigate partial observability caused by problem decomposition, we introduce a unified history-enhanced context processing module. Extensive experiments show that this framework achieves superior solution quality compared with state-of-the-art learning-based methods, with a smaller gap to classical heuristics, demonstrating strong generalization across diverse CVRP variants.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Capacitated Vehicle Routing Problem | CVRP N=50, C=50 | Objective Value9.214 | 9 | |
| Vehicle Routing Problem | CVRP 500-node 100 random instances (test) | Cost45.274 | 7 | |
| Vehicle Routing Problem | CVRP 1000-node 100 random instances (test) | Cost72.891 | 7 | |
| Vehicle Routing Optimization | VRPB | Cost9.534 | 6 | |
| Vehicle Routing Problem Optimization | Berto n=50 1.0 | CVRP Cost10.484 | 6 | |
| Vehicle Routing Problem Optimization | Berto n=100 1.0 | CVRP Cost15.837 | 6 | |
| Capacitated Vehicle Routing Problem | CVRP n=50 Capacity 60 | Cost8.433 | 4 | |
| Capacitated Vehicle Routing Problem | CVRP n=50, Capacity 70 | Cost7.882 | 4 | |
| Capacitated Vehicle Routing Problem | CVRP n=50, Capacity 80 | Total Cost7.469 | 4 | |
| Capacitated Vehicle Routing Problem | CVRP n=50, Capacity 90 | Cost7.154 | 4 |