Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems

About

Recently, deep reinforcement learning (DRL) models have shown promising results in solving NP-hard Combinatorial Optimization (CO) problems. However, most DRL solvers can only scale to a few hundreds of nodes for combinatorial optimization problems on graphs, such as the Traveling Salesman Problem (TSP). This paper addresses the scalability challenge in large-scale combinatorial optimization by proposing a novel approach, namely, DIMES. Unlike previous DRL methods which suffer from costly autoregressive decoding or iterative refinements of discrete solutions, DIMES introduces a compact continuous space for parameterizing the underlying distribution of candidate solutions. Such a continuous space allows stable REINFORCE-based training and fine-tuning via massively parallel sampling. We further propose a meta-learning framework to enable the effective initialization of model parameters in the fine-tuning stage. Extensive experiments show that DIMES outperforms recent DRL-based methods on large benchmark datasets for Traveling Salesman Problems and Maximal Independent Set problems.

Ruizhong Qiu, Zhiqing Sun, Yiming Yang• 2022

Related benchmarks

TaskDatasetResultRank
Traveling Salesman ProblemTSP-500 (test)
Gap1.76
85
Maximum Independent SetER [700-800]
Solution Size42.06
48
Traveling Salesman ProblemTSP 1K (test)
Length23.69
45
Traveling Salesperson ProblemTSP-100
Solution Length7.7617
42
Traveling Salesman ProblemTSP-1000 (test)
Optimality Gap2.46
36
Traveling Salesman ProblemTSP-500
Solution Length16.87
35
Maximum Independent SetSATLIB
MIS Size423.3
35
Traveling Salesperson ProblemTSP-1k
Solution Length23.73
31
Traveling Salesperson ProblemTSP N=1000 Generalization (128 instances)
Optimality Gap17.69
30
Traveling Salesperson ProblemTSP N=500 Generalization (128 instances)
Optimality Gap16.07
30
Showing 10 of 24 rows

Other info

Code

Follow for update