GOAL: A Generalist Combinatorial Optimization Agent Learner
About
Machine Learning-based heuristics have recently shown impressive performance in solving a variety of hard combinatorial optimization problems (COPs). However, they generally rely on a separate neural model, specialized and trained for each single problem. Any variation of a problem requires adjustment of its model and re-training from scratch. In this paper, we propose GOAL (for Generalist combinatorial Optimization Agent Learner), a generalist model capable of efficiently solving multiple COPs and which can be fine-tuned to solve new COPs. GOAL consists of a single backbone plus light-weight problem-specific adapters for input and output processing. The backbone is based on a new form of mixed-attention blocks which allows to handle problems defined on graphs with arbitrary combinations of node, edge and instance-level features. Additionally, problems which involve heterogeneous types of nodes or edges are handled through a novel multi-type transformer architecture, where the attention blocks are duplicated to attend the meaningful combinations of types while relying on the same shared parameters. We train GOAL on a set of routing, scheduling and classic graph problems and show that it is only slightly inferior to the specialized baselines while being the first multi-task model that solves a wide range of COPs. Finally we showcase the strong transfer learning capacity of GOAL by fine-tuning it on several new problems. Our code is available at https://github.com/naver/goal-co/.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Capacitated Vehicle Routing Problem | CVRP N=100 | Objective Value16.3 | 87 | |
| Traveling Salesman Problem | TSP N=20 | Optimality Gap0.26 | 45 | |
| Capacitated Vehicle Routing Problem | CVRP 20 | Objective Value6.2 | 43 | |
| Capacitated Vehicle Routing Problem | CVRP 100 | Optimality Gap (%)4.22 | 36 | |
| Traveling Salesman Problem | TSP N=100 | Cost (%)2.84 | 32 | |
| Asymmetric Capacitated Vehicle Routing Problem | Real-world ACVRP In-distribution | Cost84.341 | 22 | |
| Asymmetric Capacitated Vehicle Routing Problem | Real-world ACVRP Out-of-distribution city | Cost84.097 | 22 | |
| Asymmetric Capacitated Vehicle Routing Problem | Real-world ACVRP Out-of-distribution cluster | Cost34.318 | 22 | |
| Capacitated Vehicle Routing Problem with Time Windows | CVRPTW100 1,000 instances | Optimality Gap4.66 | 18 | |
| Capacitated Vehicle Routing Problem | CVRP N=50 | Objective Value10.73 | 17 |