Domain-Independent Dynamic Programming
About
For combinatorial optimization problems, model-based paradigms such as mixed-integer programming (MIP) and constraint programming (CP) aim to decouple modeling and solving a problem: the `holy grail' of declarative problem solving. We propose domain-independent dynamic programming (DIDP), a novel model-based paradigm based on dynamic programming (DP). While DP is not new, it has typically been implemented as a problem-specific method. We introduce Dynamic Programming Description Language (DyPDL), a formalism to define DP models based on a state transition system, inspired by artificial intelligence (AI) planning. we show that heuristic search algorithms can be used to solve DyPDL models and propose seven DIDP solvers. We experimentally compare our DIDP solvers with commercial MIP and CP solvers (solving MIP and CP models, respectively) on common benchmark instances of eleven combinatorial optimization problem classes. We show that DIDP outperforms MIP in nine problem classes, CP also in nine problem classes, and both MIP and CP in seven. DIDP also achieves superior performance to existing state-based solvers including domain-independent AI planners.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Capacitated Vehicle Routing Problem | CVRP 207 | Average Optimality Gap69.12 | 10 | |
| Capacitated Vehicle Routing Problem | CVRP 207 | Average primal integral333.7 | 10 | |
| Combinatorial Optimization | TSPTW 340 | Coverage259 | 10 | |
| Combinatorial Optimization | SALBP-1 2100 | Coverage1.80e+3 | 10 | |
| Combinatorial Optimization | Talent Scheduling 1000 | Solution Coverage239 | 10 | |
| Multi-vehicle Pickup and Delivery Problem with Time Windows | m-PDTSP 1178 | Average Primal Integral5.24 | 10 | |
| Simple Assembly Line Balancing Problem | SALBP-1 2100 | Average Primal Integral1.92 | 10 | |
| Simple Assembly Line Balancing Problem (Type 1) | SALBP-1 2100 | Average Optimality Gap0.57 | 10 | |
| Talent Scheduling | Talent Scheduling 1000 | Average Optimality Gap0.1697 | 10 | |
| Traveling Salesperson Problem with Time Windows | TSPTW 340 | Average Optimality Gap11.51 | 10 |