Towards Generalizable Neural Solvers for Vehicle Routing Problems via Ensemble with Transferrable Local Policy
About
Machine learning has been adapted to help solve NP-hard combinatorial optimization problems. One prevalent way is learning to construct solutions by deep neural networks, which has been receiving more and more attention due to the high efficiency and less requirement for expert knowledge. However, many neural construction methods for Vehicle Routing Problems~(VRPs) focus on synthetic problem instances with specified node distributions and limited scales, leading to poor performance on real-world problems which usually involve complex and unknown node distributions together with large scales. To make neural VRP solvers more practical, we design an auxiliary policy that learns from the local transferable topological features, named local policy, and integrate it with a typical construction policy (which learns from the global information of VRP instances) to form an ensemble policy. With joint training, the aggregated policies perform cooperatively and complementarily to boost generalization. The experimental results on two well-known benchmarks, TSPLIB and CVRPLIB, of travelling salesman problem and capacitated VRP show that the ensemble policy significantly improves both cross-distribution and cross-scale generalization performance, and even performs well on real-world problems with several thousand nodes.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Capacitated Vehicle Routing Problem | CVRP N=100 | Objective Value15.7997 | 87 | |
| Capacitated Vehicle Routing Problem | CVRP 20 | Objective Value6.3189 | 43 | |
| Traveling Salesman Problem | Uniform-TSP100 | Optimality Gap0.225 | 41 | |
| Capacitated Vehicle Routing Problem | CVRP 100 | Optimality Gap (%)2.13 | 36 | |
| Capacitated Vehicle Routing Problem | CVRP-200 | Objective Value20.6787 | 35 | |
| Asymmetric Traveling Salesperson Problem | ATSP N=100 (test) | Optimality Gap2.17 | 34 | |
| Capacitated Vehicle Routing Problem | CVRP 1000 | Objective Value15.8382 | 29 | |
| Vehicle Routing Problem | VRP 100 Customers (100 instances) | Objective Value15.8 | 28 | |
| Traveling Salesman Problem | Euclidean TSP n=100 Uniform distribution in unit square (test) | Tour Length7.781 | 27 | |
| Traveling Salesman Problem | Euclidean TSP n=500 Uniform distribution in unit square (test) | Tour Length17.714 | 27 |