Rethinking Light Decoder-based Solvers for Vehicle Routing Problems
About
Light decoder-based solvers have gained popularity for solving vehicle routing problems (VRPs) due to their efficiency and ease of integration with reinforcement learning algorithms. However, they often struggle with generalization to larger problem instances or different VRP variants. This paper revisits light decoder-based approaches, analyzing the implications of their reliance on static embeddings and the inherent challenges that arise. Specifically, we demonstrate that in the light decoder paradigm, the encoder is implicitly tasked with capturing information for all potential decision scenarios during solution construction within a single set of embeddings, resulting in high information density. Furthermore, our empirical analysis reveals that the overly simplistic decoder struggles to effectively utilize this dense information, particularly as task complexity increases, which limits generalization to out-of-distribution (OOD) settings. Building on these insights, we show that enhancing the decoder capacity, with a simple addition of identity mapping and a feed-forward layer, can considerably alleviate the generalization issue. Experimentally, our method significantly enhances the OOD generalization of light decoder-based approaches on large-scale instances and complex VRP variants, narrowing the gap with the heavy decoder paradigm. Our code is available at: https://github.com/ziweileonhuang/reld-nco.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Capacitated Vehicle Routing Problem | CVRP N=100 | Objective Value15.713 | 87 | |
| Vehicle Routing Problem Optimization | VRPMB (100-node instances) | Objective Value13.99 | 45 | |
| Traveling Salesman Problem | TSP N=20 | Optimality Gap0.00e+0 | 45 | |
| Capacitated Vehicle Routing Problem | CVRP 20 | Objective Value6.15 | 43 | |
| Capacitated Vehicle Routing Problem | CVRP 100 | Optimality Gap (%)1.42 | 36 | |
| Asymmetric Traveling Salesperson Problem | ATSP N=100 (test) | Optimality Gap1.64 | 34 | |
| Vehicle Routing Problem | VRP 100 Customers (100 instances) | Objective Value15.75 | 28 | |
| Vehicle Routing Problem | OCVRP 48 standard 100-node benchmark instances | Objective Value10.47 | 27 | |
| Asymmetric Capacitated Vehicle Routing Problem | Real-world ACVRP In-distribution | Cost88.154 | 22 | |
| Asymmetric Capacitated Vehicle Routing Problem | Real-world ACVRP Out-of-distribution city | Cost87.764 | 22 |