OptNet: Differentiable Optimization as a Layer in Neural Networks

About

This paper presents OptNet, a network architecture that integrates optimization problems (here, specifically in the form of quadratic programs) as individual layers in larger end-to-end trainable deep networks. These layers encode constraints and complex dependencies between the hidden states that traditional convolutional and fully-connected layers often cannot capture. We explore the foundations for such an architecture: we show how techniques from sensitivity analysis, bilevel optimization, and implicit differentiation can be used to exactly differentiate through these layers and with respect to layer parameters; we develop a highly efficient solver for these layers that exploits fast GPU-based batch solves within a primal-dual interior point method, and which provides backpropagation gradients with virtually no additional cost on top of the solve; and we highlight the application of these approaches in several problems. In one notable example, the method is learns to play mini-Sudoku (4x4) given just input and output games, with no a-priori information about the rules of the game; this highlights the ability of OptNet to learn hard constraints better than other neural architectures.

Brandon Amos, J. Zico Kolter• 2017

Related benchmarks

Task	Dataset	Result
Projection onto chains	Projection onto chains	Backward Latency (ms)7.25	29
Projection onto the probability simplex	Probability Simplex	Backward Time (ms)0.54	26
Portfolio Optimization	Multi-period Portfolio Optimization	Backward Latency (ms)3.24	25
Sudoku Solving	Sudoku	Average per-epoch runtime (ms)12.16	9
Constrained Quadratic Programming	Constrained QP 100 variables, 50 equality constraints, 30 inequality constraints	Objective Value-16.33	7
Constrained Quadratic Programming	Constrained QP 100 variables, 50 equality constraints, 50 inequality constraints	Objective Value-15.05	7
Constrained Quadratic Programming	Constrained QP 100 variables, 50 equality constraints, 130 inequality constraints	Objective Value-12.73	7
Constrained Quadratic Programming	Constrained QP 100 variables, 50 equality constraints, 150 inequality constraints	Objective Value-12.67	7

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord