Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks
About
It is a highly desirable property for deep networks to be robust against small input changes. One popular way to achieve this property is by designing networks with a small Lipschitz constant. In this work, we propose a new technique for constructing such Lipschitz networks that has a number of desirable properties: it can be applied to any linear network layer (fully-connected or convolutional), it provides formal guarantees on the Lipschitz constant, it is easy to implement and efficient to run, and it can be combined with any training objective and optimization method. In fact, our technique is the first one in the literature that achieves all of these properties simultaneously. Our main contribution is a rescaling-based weight matrix parametrization that guarantees each network layer to have a Lipschitz constant of at most 1 and results in the learned weight matrices to be close to orthogonal. Hence we call such layers almost-orthogonal Lipschitz (AOL). Experiments and ablation studies in the context of image classification with certified robust accuracy confirm that AOL layers achieve results that are on par with most existing methods. Yet, they are simpler to implement and more broadly applicable, because they do not require computationally expensive matrix orthogonalization or inversion steps as part of the network architecture. We provide code at https://github.com/berndprach/AOL.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Certified Accuracy | Mean Certified Accuracy (72/255) (test) | Wins A66 | 15 | |
| Certified Robustness | Mean Certified Accuracy 36/255 | Wins A60 | 15 | |
| Certified Robustness | Mean Certified Accuracy 108/255 | Wins A65 | 15 | |
| Certified Accuracy | Mean Certified Accuracy 255/255 | Total Samples (N)121 | 15 | |
| Mean Accuracy | 121 benchmarks tasks | Wins A60 | 15 | |
| Image Classification | CIFAR10 (test) | Accuracy0.716 | 11 | |
| Image Classification | CIFAR-100 (test) | Clean Accuracy0.437 | 11 | |
| Image Classification | CIFAR-10 (test) | Clean Accuracy71.6 | 10 | |
| Image Classification | CIFAR100 (test) | Natural Accuracy0.437 | 9 | |
| Mean Certified Accuracy | UCI data sets (test) | Average Rank3.417 | 6 |