Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

About

Unlike traditional homogeneous routing problems, the Heterogeneous Fleet Vehicle Routing Problem (HFVRP) involves heterogeneous fixed costs, variable travel costs, and capacity constraints, rendering solution quality highly sensitive to vehicle selection. Furthermore, real-world logistics applications often impose additional complex constraints, markedly increasing computational complexity. However, most existing Deep Reinforcement Learning (DRL)-based methods are restricted to homogeneous scenarios, leading to suboptimal performance when applied to HFVRP and its complex variants. To bridge this gap, we investigate HFVRP under complex constraints and develop a unified DRL framework capable of solving the problem across various variant settings. We introduce the Vehicle-as-Prompt (VaP) mechanism, which formulates the problem as a single-stage autoregressive decision process. Building on this, we propose VaP-CSMV, a framework featuring a cross-semantic encoder and a multi-view decoder that effectively addresses various problem variants and captures the complex mapping relationships between vehicle heterogeneity and customer node attributes. Extensive experimental results demonstrate that VaP-CSMV significantly outperforms existing state-of-the-art DRL-based neural solvers and achieves competitive solution quality compared to traditional heuristic solvers, while reducing inference time to mere seconds. Furthermore, the framework exhibits strong zero-shot generalization capabilities on large-scale and previously unseen problem variants, while ablation studies validate the vital contribution of each component.

Shihong Huang, Shengjie Wang, Lei Gao, Hong Ma, Zhanluo Zhang, Feng Zhang, Weihua Zhou• 2026

Related benchmarks

TaskDatasetResultRank
Heterogeneous Fleet Open Vehicle Routing ProblemHFOVRP N=50, K=20 (test)
Objective Value2.6
20
Heterogeneous Fleet Capacitated Vehicle Routing ProblemHFCVRP (N=50, K=20) (test)
Objective Value3.54
20
Heterogeneous Fleet Vehicle Routing ProblemHFCVRP (N=80, K=30)
Objective Value5.04
20
Vehicle Routing ProblemHFVRPL N=100 K=30
Objective Value6.09
10
Heterogeneous Fleet Capacitated Vehicle Routing ProblemHFCVRP zero-shot N=120, K=40
Objective Value7.05
10
Heterogeneous Fleet Open Vehicle Routing ProblemHFOVRP zero-shot (N=120, K=40)
Objective Value4.95
10
Heterogeneous Fleet Vehicle Routing Problem with BackhaulsHFVRPB zero-shot N=120, K=40
Objective Value6.77
10
Heterogeneous Fleet Vehicle Routing Problem with LinehaulsHFVRPL zero-shot (N=120, K=40)
Objective Value7.21
10
Vehicle Routing ProblemHFCVRP N=100, K=30
Objective Value5.95
10
Vehicle Routing ProblemHFOVRP N=100, K=30
Objective Value4.29
10
Showing 10 of 27 rows

Other info

Follow for update