Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Attention, Learn to Solve Routing Problems!

About

The recently presented idea to learn heuristics for combinatorial optimization problems is promising as it can save costly development. However, to push this idea towards practical implementation, we need better models and better ways of training. We contribute in both directions: we propose a model based on attention layers with benefits over the Pointer Network and we show how to train this model using REINFORCE with a simple baseline based on a deterministic greedy rollout, which we find is more efficient than using a value function. We significantly improve over recent learned heuristics for the Travelling Salesman Problem (TSP), getting close to optimal results for problems up to 100 nodes. With the same hyperparameters, we learn strong heuristics for two variants of the Vehicle Routing Problem (VRP), the Orienteering Problem (OP) and (a stochastic variant of) the Prize Collecting TSP (PCTSP), outperforming a wide range of baselines and getting results close to highly optimized and specialized algorithms.

Wouter Kool, Herke van Hoof, Max Welling• 2018

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningMATH500 (test)--
895
Multi-robot task planningF1-F16 fixed-scale simulated instances
Makespan (s)138.7
112
Traveling Salesman ProblemTSP-500 (test)
Gap18.03
95
Capacitated Vehicle Routing ProblemCVRP N=100
Objective Value16.139
87
Traveling Salesman ProblemTSP-100
Optimality Drop4.53
69
Traveling Salesman ProblemTSP-1000 (test)
Optimality Gap29.23
57
Traveling Salesman Problem (TSP)TSP n=100 10K instances (test)
Objective Value7.94
52
Traveling Salesman ProblemTSP 1K (test)
Length33.55
45
Capacitated Vehicle Routing ProblemCVRP N=100 10,000 instances (test)
Objective Value16.23
44
Capacitated Vehicle Routing ProblemCVRP 20
Objective Value6.24
43
Showing 10 of 110 rows
...

Other info

Follow for update