Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization

About

Creating impact in real-world settings requires artificial intelligence techniques to span the full pipeline from data, to predictive models, to decisions. These components are typically approached separately: a machine learning model is first trained via a measure of predictive accuracy, and then its predictions are used as input into an optimization algorithm which produces a decision. However, the loss function used to train the model may easily be misaligned with the end goal, which is to make the best decisions possible. Hand-tuning the loss function to align with optimization is a difficult and error-prone process (which is often skipped entirely). We focus on combinatorial optimization problems and introduce a general framework for decision-focused learning, where the machine learning model is directly trained in conjunction with the optimization algorithm to produce high-quality decisions. Technically, our contribution is a means of integrating common classes of discrete optimization problems into deep learning or other predictive models, which are typically trained via gradient descent. The main idea is to use a continuous relaxation of the discrete problem to propagate gradients through the optimization procedure. We instantiate this framework for two broad classes of combinatorial problems: linear programs and submodular maximization. Experimental results across a variety of domains show that decision-focused learning often leads to improved optimization performance compared to traditional methods. We find that standard measures of accuracy are not a reliable proxy for a predictive model's utility in optimization, and our method's ability to specify the true goal as the model's training objective yields substantial dividends across a range of decision problems.

Bryan Wilder, Bistra Dilkina, Milind Tambe• 2018

Related benchmarks

Task	Dataset	Result
Minimum Cost Vertex Cover	POLSKA Real-life (test)	Mean Regret2.57	20
Minimum Cost Vertex Cover	PDH Real-life (test)	Mean Regret9.28	20
Minimum Cost Vertex Cover	POLSKA Artificial (test)	Mean Regret117.5	20
Minimum Cost Vertex Cover	PDH Artificial (test)	Mean Regret79.31	20
Minimum Cost Flow Problem (MCFP)	GÉANT Real-life Size 100	Mean Regret84.6	10
Minimum Cost Flow Problem (MCFP)	GÉANT (Real-life) Size 300	Mean Regret102.3	10
Minimum Cost Flow Problem (MCFP)	GÉANT Artificial Size 100	Mean Regret1.06e+3	10
Minimum Cost Flow Problem (MCFP)	GÉANT Artificial Size 300	Mean Regret1.09e+3	10
Minimum Cost Flow Problem (MCFP)	USANet Real-life Size 100	Mean Regret259.9	10
Minimum Cost Flow Problem (MCFP)	USANet Real-life Size 300	Mean Regret212.8	10

Showing 10 of 15 rows

Other info

Follow for update

@wizwand_team Discord