Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning to Search Feasible and Infeasible Regions of Routing Problems with Flexible Neural k-Opt

About

In this paper, we present Neural k-Opt (NeuOpt), a novel learning-to-search (L2S) solver for routing problems. It learns to perform flexible k-opt exchanges based on a tailored action factorization method and a customized recurrent dual-stream decoder. As a pioneering work to circumvent the pure feasibility masking scheme and enable the autonomous exploration of both feasible and infeasible regions, we then propose the Guided Infeasible Region Exploration (GIRE) scheme, which supplements the NeuOpt policy network with feasibility-related features and leverages reward shaping to steer reinforcement learning more effectively. Additionally, we equip NeuOpt with Dynamic Data Augmentation (D2A) for more diverse searches during inference. Extensive experiments on the Traveling Salesman Problem (TSP) and Capacitated Vehicle Routing Problem (CVRP) demonstrate that our NeuOpt not only significantly outstrips existing (masking-based) L2S solvers, but also showcases superiority over the learning-to-construct (L2C) and learning-to-predict (L2P) solvers. Notably, we offer fresh perspectives on how neural solvers can handle VRP constraints. Our code is available: https://github.com/yining043/NeuOpt.

Yining Ma, Zhiguang Cao, Yeow Meng Chee• 2023

Related benchmarks

TaskDatasetResultRank
Traveling Salesman ProblemTSP50
Optimality Gap0.00e+0
58
Capacitated Vehicle Routing ProblemCVRP N=100
Objective Value15.579
50
Traveling Salesman ProblemTSP-200
Optimality Gap0.01
28
Traveling Salesman ProblemTSP N=200
Cost Gap0.01
24
Traveling Salesman Problem with Time WindowTSPTW Hard n=100
Objective Value46.913
22
Traveling Salesman ProblemTSP N=100
Cost (%)0.00e+0
20
Traveling Salesman Problem with Time WindowsTSPTW hard variant (n=50)
Infeasibility Rate0.02
20
Capacitated Vehicle Routing ProblemCVRP-200
Objective Value21.83
20
Capacitated Vehicle Routing Problem with Backhauls and Time WindowsCVRPBLTW n=50 v1
Objective Value14.201
18
Capacitated Vehicle Routing Problem with Backhauls and Time WindowsCVRPBLTW n=100 v1
Objective Value24.038
18
Showing 10 of 17 rows

Other info

Code

Follow for update