Pareto Set Learning for Neural Multi-objective Combinatorial Optimization

About

Multiobjective combinatorial optimization (MOCO) problems can be found in many real-world applications. However, exactly solving these problems would be very challenging, particularly when they are NP-hard. Many handcrafted heuristic methods have been proposed to tackle different MOCO problems over the past decades. In this work, we generalize the idea of neural combinatorial optimization, and develop a learning-based approach to approximate the whole Pareto set for a given MOCO problem without further search procedure. We propose a single preference-conditioned model to directly generate approximate Pareto solutions for any trade-off preference, and design an efficient multiobjective reinforcement learning algorithm to train this model. Our proposed method can be treated as a learning-based extension for the widely-used decomposition-based multiobjective evolutionary algorithm (MOEA/D). It uses a single model to accommodate all the possible preferences, whereas other methods use a finite number of solutions to approximate the Pareto set. Experimental results show that our proposed method significantly outperforms some other methods on the multiobjective traveling salesman problem, multiobjective vehicle routing problem, and multiobjective knapsack problem in terms of solution quality, speed, and model efficiency.

Xi Lin, Zhiyuan Yang, Qingfu Zhang• 2022

Related benchmarks

Task	Dataset	Result
Multi-Objective Traveling Salesperson Problem	KroAB200	Hypervolume (HV)72.51	44
Tri-Objective Traveling Salesman Problem	Tri-TSP50	Hypervolume (HV)0.4409	44
Bi-objective Traveling Salesman Problem	Bi-TSP50	Hypervolume (HV)0.6395	44
Multi-Objective Traveling Salesperson Problem	KroAB100	Hypervolume (HV)0.6937	44
Multi-Objective Traveling Salesperson Problem	KroAB150	Hypervolume (HV)68.86	44
Multi-objective Knapsack Problem	Bi-KP n=50	HV0.3552	34
Multi-objective Knapsack Problem	Bi-KP n=100	HV0.4523	34
Multi-objective Knapsack Problem	Bi-KP n=200	HV0.3595	34
Tri-Objective Traveling Salesman Problem	Tri-TSP 20 nodes (test)	Hypervolume (HV)0.4712	24
Bi-objective Traveling Salesman Problem	Bi-TSP 100	Hypervolume (HV)70.37	24

Showing 10 of 40 rows

Other info

Follow for update

@wizwand_team Discord