Score Function Gradient Estimation to Widen the Applicability of Decision-Focused Learning

About

Many real-world optimization problems contain parameters that are unknown before deployment time, either due to stochasticity or to lack of information (e.g., demand or travel times in delivery problems). A common strategy in such cases is to estimate said parameters via machine learning (ML) models trained to minimize the prediction error, which however is not necessarily aligned with the downstream task-level error. The decision-focused learning (DFL) paradigm overcomes this limitation by training to directly minimize a task loss, e.g. regret. Since the latter has non-informative gradients for combinatorial problems, state-of-the-art DFL methods introduce surrogates and approximations that enable training. But these methods exploit specific assumptions about the problem structures (e.g., convex or linear problems, unknown parameters only in the objective function). We propose an alternative method that makes no such assumptions, it combines stochastic smoothing with score function gradient estimation which works on any task loss. This opens up the use of DFL methods to nonlinear objectives, uncertain parameters in the problem constraints, and even two-stage stochastic optimization. Experiments show that it typically requires more epochs, but that it is on par with specialized methods and performs especially well for the difficult case of problems with uncertainty in the constraints, in terms of solution quality, scalability, or both.

Mattia Silvestri, Senne Berden, Jayanta Mandi, Ali \.Irfan Mahmuto\u{g}ullar{\i}, Brandon Amos, Tias Guns, Michele Lombardi• 2023

Related benchmarks

Task	Dataset	Result
Predicting Constraints Parameters	WSMC 10x50	Relative PRegret1.93	9
Predicting Constraints Parameters for Integer Linear Programs	KP-50 (test)	Relative Primal Regret0.126	9
Predicting Objective Parameters	KP-50 (test)	Relative Regret0.008	5
Predicting Constraint Parameters	Fractional Knapsack Problem (KP) capacity=50, ρ=0 synthetic (test)	Relative Predictive Regret38.5	3
Predicting Constraint Parameters	Fractional Knapsack Problem (KP) capacity=50 ρ=1 synthetic (test)	Rel. PRegret46.7	3
Predicting Constraint Parameters	Fractional Knapsack Problem (KP) capacity=50, ρ=2 synthetic (test)	Relative Predictive Regret0.512	3
Combinatorial Optimization	WSMC-10-250	Runtime900	2
Combinatorial Optimization	WSMC-10-500	Runtime900	2
Combinatorial Optimization	WSMC-10-750	Runtime900	2
Combinatorial Optimization	WSMC-10-1000	Runtime900	2

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord