Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Meta-Learning-Based Deep Reinforcement Learning for Multiobjective Optimization Problems

About

Deep reinforcement learning (DRL) has recently shown its success in tackling complex combinatorial optimization problems. When these problems are extended to multiobjective ones, it becomes difficult for the existing DRL approaches to flexibly and efficiently deal with multiple subproblems determined by weight decomposition of objectives. This paper proposes a concise meta-learning-based DRL approach. It first trains a meta-model by meta-learning. The meta-model is fine-tuned with a few update steps to derive submodels for the corresponding subproblems. The Pareto front is then built accordingly. Compared with other learning-based methods, our method can greatly shorten the training time of multiple submodels. Due to the rapid and excellent adaptability of the meta-model, more submodels can be derived so as to increase the quality and diversity of the found solutions. The computational experiments on multiobjective traveling salesman problems and multiobjective vehicle routing problem with time windows demonstrate the superiority of our method over most of learning-based and iteration-based approaches.

Zizhen Zhang, Zhiyuan Wu, Hang Zhang, Jiahai Wang• 2021

Related benchmarks

TaskDatasetResultRank
Tri-Objective Traveling Salesman ProblemTri-TSP20
Hypervolume (HV)0.4712
20
Bi-objective Traveling Salesman ProblemBi-TSP20
HV0.6271
20
Bi-objective Traveling Salesman ProblemBi-TSP50
Hypervolume (HV)0.6408
20
Tri-Objective Traveling Salesman ProblemTri-TSP50
Hypervolume (HV)0.4408
20
Multi-Objective Traveling Salesperson ProblemKroAB100
Hypervolume (HV)0.695
20
Bi-objective Traveling Salesman ProblemBi-TSP200
Hypervolume (HV)72.99
20
Bi-objective Traveling Salesman ProblemBi-TSP100
Hypervolume (HV)0.7022
20
Multi-Objective Traveling Salesperson ProblemKroAB200
Hypervolume (HV)72.61
20
Tri-Objective Traveling Salesman ProblemTri-TSP100
Hypervolume (HV)49.58
20
Bi-objective Traveling Salesman ProblemBi-TSP150
Hypervolume (HV)0.6976
20
Showing 10 of 28 rows

Other info

Follow for update