Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem
About
Unlike traditional homogeneous routing problems, the Heterogeneous Fleet Vehicle Routing Problem (HFVRP) involves heterogeneous fixed costs, variable travel costs, and capacity constraints, rendering solution quality highly sensitive to vehicle selection. Furthermore, real-world logistics applications often impose additional complex constraints, markedly increasing computational complexity. However, most existing Deep Reinforcement Learning (DRL)-based methods are restricted to homogeneous scenarios, leading to suboptimal performance when applied to HFVRP and its complex variants. To bridge this gap, we investigate HFVRP under complex constraints and develop a unified DRL framework capable of solving the problem across various variant settings. We introduce the Vehicle-as-Prompt (VaP) mechanism, which formulates the problem as a single-stage autoregressive decision process. Building on this, we propose VaP-CSMV, a framework featuring a cross-semantic encoder and a multi-view decoder that effectively addresses various problem variants and captures the complex mapping relationships between vehicle heterogeneity and customer node attributes. Extensive experimental results demonstrate that VaP-CSMV significantly outperforms existing state-of-the-art DRL-based neural solvers and achieves competitive solution quality compared to traditional heuristic solvers, while reducing inference time to mere seconds. Furthermore, the framework exhibits strong zero-shot generalization capabilities on large-scale and previously unseen problem variants, while ablation studies validate the vital contribution of each component.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Heterogeneous Fleet Open Vehicle Routing Problem | HFOVRP N=50, K=20 (test) | Objective Value2.6 | 20 | |
| Heterogeneous Fleet Capacitated Vehicle Routing Problem | HFCVRP (N=50, K=20) (test) | Objective Value3.54 | 20 | |
| Heterogeneous Fleet Vehicle Routing Problem | HFCVRP (N=80, K=30) | Objective Value5.04 | 20 | |
| Vehicle Routing Problem | HFVRPL N=100 K=30 | Objective Value6.09 | 10 | |
| Heterogeneous Fleet Capacitated Vehicle Routing Problem | HFCVRP zero-shot N=120, K=40 | Objective Value7.05 | 10 | |
| Heterogeneous Fleet Open Vehicle Routing Problem | HFOVRP zero-shot (N=120, K=40) | Objective Value4.95 | 10 | |
| Heterogeneous Fleet Vehicle Routing Problem with Backhauls | HFVRPB zero-shot N=120, K=40 | Objective Value6.77 | 10 | |
| Heterogeneous Fleet Vehicle Routing Problem with Linehauls | HFVRPL zero-shot (N=120, K=40) | Objective Value7.21 | 10 | |
| Vehicle Routing Problem | HFCVRP N=100, K=30 | Objective Value5.95 | 10 | |
| Vehicle Routing Problem | HFOVRP N=100, K=30 | Objective Value4.29 | 10 |