Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning to Solve Orienteering Problem with Time Windows and Variable Profits

About

The orienteering problem with time windows and variable profits (OPTWVP) is common in many real-world applications and involves continuous time variables. Current approaches fail to develop an efficient solver for this orienteering problem variant with discrete and continuous variables. In this paper, we propose a learning-based two-stage DEcoupled discrete-Continuous optimization with Service-time-guided Trajectory (DeCoST), which aims to effectively decouple the discrete and continuous decision variables in the OPTWVP problem, while enabling efficient and learnable coordination between them. In the first stage, a parallel decoding structure is employed to predict the path and the initial service time allocation. The second stage optimizes the service times through a linear programming (LP) formulation and provides a long-horizon learning of structure estimation. We rigorously prove the global optimality of the second-stage solution. Experiments on OPTWVP instances demonstrate that DeCoST outperforms both state-of-the-art constructive solvers and the latest meta-heuristic algorithms in terms of solution quality and computational efficiency, achieving up to 6.6x inference speedup on instances with fewer than 500 nodes. Moreover, the proposed framework is compatible with various constructive solvers and consistently enhances the solution quality for OPTWVP.

Songqun Gao, Zanxi Ruan, Patrick Floor, Marco Roveri, Luigi Palopoli, Daniele Fontanelli• 2026

Related benchmarks

TaskDatasetResultRank
Combinatorial OptimizationOPTWVP n=50, TW=500
Score51.4
13
Combinatorial OptimizationOPTWVP n=50, TW=100
Score15.1
12
Combinatorial OptimizationOPTWVP n=100, TW=100
Score25.6
12
OPTWVPOPTWVP large-scale configuration n = 500, TW = 100
Score79.6
7
Combinatorial OptimizationOPTWVP n=100, TW=500
Runtime100
7
OPTWVPOPTWVP n=100, TW=500
Runtime (s)100
6
OPTWVPSolomon 100 (test)
Score1.07e+4
3
Team Orienteering Problem with Time Windows and Variable ProfitsTOPTWVP n = 50, TW = 100
Score36.99
3
Showing 8 of 8 rows

Other info

Follow for update