DeepACO: Neural-enhanced Ant Systems for Combinatorial Optimization
About
Ant Colony Optimization (ACO) is a meta-heuristic algorithm that has been successfully applied to various Combinatorial Optimization Problems (COPs). Traditionally, customizing ACO for a specific problem requires the expert design of knowledge-driven heuristics. In this paper, we propose DeepACO, a generic framework that leverages deep reinforcement learning to automate heuristic designs. DeepACO serves to strengthen the heuristic measures of existing ACO algorithms and dispense with laborious manual design in future ACO applications. As a neural-enhanced meta-heuristic, DeepACO consistently outperforms its ACO counterparts on eight COPs using a single neural architecture and a single set of hyperparameters. As a Neural Combinatorial Optimization method, DeepACO performs better than or on par with problem-specific methods on canonical routing problems. Our code is publicly available at https://github.com/henry-yeh/DeepACO.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Traveling Salesman Problem | TSP-500 (test) | Gap1.84 | 85 | |
| Traveling Salesman Problem | TSP50 | Optimality Gap0.00e+0 | 58 | |
| Traveling Salesman Problem | TSP-100 | -- | 53 | |
| Capacitated Vehicle Routing Problem | CVRP N=100 | Objective Value14.93 | 50 | |
| Traveling Salesman Problem | TSP 1K (test) | Length23.85 | 30 | |
| Capacitated Vehicle Routing Problem | CVRP | Objective Value9.13 | 21 | |
| Offline Bin Packing Problem | Offline BPP | Objective Value203.1 | 21 | |
| Multidimensional Knapsack Problem | MKP | Objective Value103 | 21 | |
| Traveling Salesman Problem | TSP | Objective Value5.811 | 21 | |
| Orienteering Problem | OP | Objective Value55.458 | 21 |