Learning with Differentiable Perturbed Optimizers

About

Machine learning pipelines often rely on optimization procedures to make discrete decisions (e.g., sorting, picking closest neighbors, or shortest paths). Although these discrete decisions are easily computed, they break the back-propagation of computational graphs. In order to expand the scope of learning problems that can be solved in an end-to-end fashion, we propose a systematic method to transform optimizers into operations that are differentiable and never locally constant. Our approach relies on stochastically perturbed optimizers, and can be used readily together with existing solvers. Their derivatives can be evaluated efficiently, and smoothness tuned via the chosen noise amplitude. We also show how this framework can be connected to a family of losses developed in structured prediction, and give theoretical guarantees for their use in learning tasks. We demonstrate experimentally the performance of our approach on various tasks.

Quentin Berthet, Mathieu Blondel, Olivier Teboul, Marco Cuturi, Jean-Philippe Vert, Francis Bach• 2020

Related benchmarks

Task	Dataset	Result
Shortest Path Prediction	Warcraft II	Accuracy94.8	16
Set Matching	Set Matching SM1 (test)	Regret (%)92.08	10
Synthetic Shortest Path	Synthetic Shortest Path SP1 (test)	Regret (%)17.73	10
Set Matching	Set Matching SM2 (test)	Regret (%)92.35	10
Synthetic Shortest Path	Synthetic Shortest Path SP2 (test)	Regret (%)11.83	10
Set Matching	Set Matching (SM3) (test)	Regret (%)92.35	10
Synthetic Shortest Path	Synthetic Shortest Path SP3 (test)	Regret Rate125.6	10
Predicting Objective Parameters	KP-50 (test)	Relative Regret0.002	5

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord