Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A Unified Knowledge Embedded Reinforcement Learning-based Framework for Generalized Capacitated Vehicle Routing Problems

About

The Capacitated Vehicle Routing Problem (CVRP) is a fundamental NP-hard problem with broad applications in logistics and transportation. Real-world CVRPs often involve diverse objectives and complex constraints, such as time windows or backhaul requirements, motivating the development of a unified solution framework. Recent reinforcement learning (RL) approaches have shown promise in combinatorial optimization, yet they rely on end-to-end learning and lack explicit problem-solving knowledge, limiting solution quality. In this paper, we propose a knowledge-embedded framework inspired by the Route-First Cluster-Second heuristics. It incorporates knowledge at two levels: (1) decomposing CVRPs into the route-first and cluster-second subproblems, and (2) leveraging dynamic programming to solve the second subproblem, whose results guide the RL-based constructive solver to solve the first problem. To mitigate partial observability caused by problem decomposition, we introduce a unified history-enhanced context processing module. Extensive experiments show that this framework achieves superior solution quality compared with state-of-the-art learning-based methods, with a smaller gap to classical heuristics, demonstrating strong generalization across diverse CVRP variants.

Wen Wang, Xiangchen Wu, Liang Wang, Hao Hu, Xianping Tao• 2026

Related benchmarks

TaskDatasetResultRank
Capacitated Vehicle Routing ProblemCVRP N=50, C=50
Objective Value9.214
9
Vehicle Routing ProblemCVRP 500-node 100 random instances (test)
Cost45.274
7
Vehicle Routing ProblemCVRP 1000-node 100 random instances (test)
Cost72.891
7
Vehicle Routing OptimizationVRPB
Cost9.534
6
Vehicle Routing Problem OptimizationBerto n=50 1.0
CVRP Cost10.484
6
Vehicle Routing Problem OptimizationBerto n=100 1.0
CVRP Cost15.837
6
Capacitated Vehicle Routing ProblemCVRP n=50 Capacity 60
Cost8.433
4
Capacitated Vehicle Routing ProblemCVRP n=50, Capacity 70
Cost7.882
4
Capacitated Vehicle Routing ProblemCVRP n=50, Capacity 80
Total Cost7.469
4
Capacitated Vehicle Routing ProblemCVRP n=50, Capacity 90
Cost7.154
4
Showing 10 of 13 rows

Other info

Follow for update