Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Pareto Set Learning for Neural Multi-objective Combinatorial Optimization

About

Multiobjective combinatorial optimization (MOCO) problems can be found in many real-world applications. However, exactly solving these problems would be very challenging, particularly when they are NP-hard. Many handcrafted heuristic methods have been proposed to tackle different MOCO problems over the past decades. In this work, we generalize the idea of neural combinatorial optimization, and develop a learning-based approach to approximate the whole Pareto set for a given MOCO problem without further search procedure. We propose a single preference-conditioned model to directly generate approximate Pareto solutions for any trade-off preference, and design an efficient multiobjective reinforcement learning algorithm to train this model. Our proposed method can be treated as a learning-based extension for the widely-used decomposition-based multiobjective evolutionary algorithm (MOEA/D). It uses a single model to accommodate all the possible preferences, whereas other methods use a finite number of solutions to approximate the Pareto set. Experimental results show that our proposed method significantly outperforms some other methods on the multiobjective traveling salesman problem, multiobjective vehicle routing problem, and multiobjective knapsack problem in terms of solution quality, speed, and model efficiency.

Xi Lin, Zhiyuan Yang, Qingfu Zhang• 2022

Related benchmarks

TaskDatasetResultRank
Multi-Objective Traveling Salesman ProblemBi-TSP-1 n=50
Hypervolume (HV)0.6395
15
Multi-Objective Traveling Salesman ProblemBi-TSP-1 n=20
Hypervolume0.627
15
Multi-Objective Traveling Salesman ProblemTri-TSP-1 n=20
Hypervolume (HV)0.4712
9
Multi-Objective Traveling Salesman ProblemTri-TSP-1 n=50
HV0.4409
9
Bi-objective Traveling Salesman ProblemBi-TSP-1 n=150 (200 random instances)
HV0.6967
9
Bi-objective Traveling Salesman ProblemBi-TSP-1 n=200 (random instances)
Hypervolume (HV)0.7283
9
Multi-Objective Traveling Salesman ProblemKroAB100 generalization TSPLIB (test)
Hypervolume (HV)0.6937
9
Multi-Objective Traveling Salesman ProblemKroAB150 generalization TSPLIB (test)
HV0.6886
9
Multi-Objective Traveling Salesman ProblemKroAB200 generalization TSPLIB (test)
Hypervolume (HV)0.7251
9
Multi-Objective Traveling Salesman ProblemBi-TSP-1 n=100
HV0.7016
9
Showing 10 of 17 rows

Other info

Follow for update