Heterogeneous Attentions for Solving Pickup and Delivery Problem via Deep Reinforcement Learning

About

Recently, there is an emerging trend to apply deep reinforcement learning to solve the vehicle routing problem (VRP), where a learnt policy governs the selection of next node for visiting. However, existing methods could not handle well the pairing and precedence relationships in the pickup and delivery problem (PDP), which is a representative variant of VRP. To address this challenging issue, we leverage a novel neural network integrated with a heterogeneous attention mechanism to empower the policy in deep reinforcement learning to automatically select the nodes. In particular, the heterogeneous attention mechanism specifically prescribes attentions for each role of the nodes while taking into account the precedence constraint, i.e., the pickup node must precede the pairing delivery node. Further integrated with a masking scheme, the learnt policy is expected to find higher-quality solutions for solving PDP. Extensive experimental results show that our method outperforms the state-of-the-art heuristic and deep learning model, respectively, and generalizes well to different distributions and problem sizes.

Jingwen Li, Liang Xin, Zhiguang Cao, Andrew Lim, Wen Song, Jie Zhang• 2021

Related benchmarks

Task	Dataset	Result
Pickup and Delivery Problem	PDP20 uniform	Objective Value4.595	9
Pickup and Delivery Problem (PDP)	PDP40 cluster (test)	Objective Value3.563	9
Pickup and Delivery Problem	PDP10 uniform	Objective Value3.297	9
Pickup and Delivery Problem	PDP80 uniform	Objective Value10.101	9
Pickup and Delivery Problem (PDP)	PDP80-cluster (test)	Objective Value4.737	9
Pickup and Delivery Problem	PDP40 uniform	Objective Value6.802	9
Pickup and Delivery Problem (PDP)	PDP10-cluster (test)	Objective Value2.254	9
Pickup and Delivery Problem (PDP)	PDP20-cluster (test)	Objective Value2.764	9
Pickup and Delivery Traveling Salesman Problem	PDTSP100 1,000 instances	Solution Gap6.61	4

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord