Unsupervised Learning for Combinatorial Optimization Needs Meta-Learning
About
A general framework of unsupervised learning for combinatorial optimization (CO) is to train a neural network (NN) whose output gives a problem solution by directly optimizing the CO objective. Albeit with some advantages over traditional solvers, the current framework optimizes an averaged performance over the distribution of historical problem instances, which misaligns with the actual goal of CO that looks for a good solution to every future encountered instance. With this observation, we propose a new objective of unsupervised learning for CO where the goal of learning is to search for good initialization for future problem instances rather than give direct solutions. We propose a meta-learning-based training pipeline for this new objective. Our method achieves good empirical performance. We observe that even just the initial solution given by our model before fine-tuning can significantly outperform the baselines under various evaluation settings including evaluation across multiple datasets, and the case with big shifts in the problem scale. The reason we conjecture is that meta-learning-based training lets the model be loosely tied to each local optima for a training instance while being more adaptive to the changes of optimization landscapes across instances.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Maximum Clique | Twitter MC instances (static) | Mean ApR0.9875 | 38 | |
| Maximum Clique | RB200 MC instances (static) | Mean ApR95.546 | 38 | |
| Maximum Clique | RB500 MC instances (static) | Mean ApR95.514 | 36 | |
| Maximum Clique | COLLAB | Mean ApR0.9997 | 30 | |
| Minimum Vertex Cover | RB200 (test) | Approximation Ratio1.028 | 24 | |
| Minimum Vertex Cover | COLLAB (test) | AR*1.002 | 16 | |
| Minimum Vertex Cover | RB500 (test) | Approximation Ratio1.016 | 13 | |
| Minimum Vertex Cover | IMDB-BINARY (test) | AR*1 | 12 | |
| Minimum Vertex Cover | Twitter (test) | AR*1.014 | 12 | |
| Maximum Cut | BA 200-300 graphs (test) | MCut Value688.3 | 11 |