Pointerformer: Deep Reinforced Multi-Pointer Transformer for the Traveling Salesman Problem

About

Traveling Salesman Problem (TSP), as a classic routing optimization problem originally arising in the domain of transportation and logistics, has become a critical task in broader domains, such as manufacturing and biology. Recently, Deep Reinforcement Learning (DRL) has been increasingly employed to solve TSP due to its high inference efficiency. Nevertheless, most of existing end-to-end DRL algorithms only perform well on small TSP instances and can hardly generalize to large scale because of the drastically soaring memory consumption and computation time along with the enlarging problem scale. In this paper, we propose a novel end-to-end DRL approach, referred to as Pointerformer, based on multi-pointer Transformer. Particularly, Pointerformer adopts both reversible residual network in the encoder and multi-pointer network in the decoder to effectively contain memory consumption of the encoder-decoder architecture. To further improve the performance of TSP solutions, Pointerformer employs both a feature augmentation method to explore the symmetries of TSP at both training and inference stages as well as an enhanced context embedding approach to include more comprehensive context information in the query. Extensive experiments on a randomly generated benchmark and a public benchmark have shown that, while achieving comparative results on most small-scale TSP instances as SOTA DRL approaches do, Pointerformer can also well generalize to large-scale TSPs.

Yan Jin, Yuandong Ding, Xuanhao Pan, Kun He, Li Zhao, Tao Qin, Lei Song, Jiang Bian• 2023

Related benchmarks

Task	Dataset	Result
Traveling Salesman Problem	TSP50	Optimality Gap0.02	77
Traveling Salesman Problem	TSP-100	Optimality Drop0.15	69
Traveling Salesman Problem	Uniform-TSP100	Optimality Gap0.163	41
Traveling Salesman Problem	TSP500 Uniform distribution, scale ≤ 1,000	Objective Value17.0854	17
Traveling Salesperson Problem	TSP1000	Optimality Gap20.7	17
Traveling Salesman Problem	TSP1000 Uniform distribution, scale ≤ 1,000	Objective Value24.799	17
Traveling Salesman Problem	TSP500	Optimality Gap12.5	16
Traveling Salesman Problem	TSP200 Uniform distribution, scale ≤ 1,000	Objective Value10.7796	13
Traveling Salesman Problem	TSP200	Solution Gap (%)1.45	10

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord