Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Efficient Neural Network Robustness Certification with General Activation Functions

About

Finding minimum distortion of adversarial examples and thus certifying robustness in neural network classifiers for given data points is known to be a challenging problem. Nevertheless, recently it has been shown to be possible to give a non-trivial certified lower bound of minimum adversarial distortion, and some recent progress has been made towards this direction by exploiting the piece-wise linear nature of ReLU activations. However, a generic robustness certification for general activation functions still remains largely unexplored. To address this issue, in this paper we introduce CROWN, a general framework to certify robustness of neural networks with general activation functions for given input data points. The novelty in our algorithm consists of bounding a given activation function with linear and quadratic functions, hence allowing it to tackle general activation functions including but not limited to four popular choices: ReLU, tanh, sigmoid and arctan. In addition, we facilitate the search for a tighter certified lower bound by adaptively selecting appropriate surrogates for each neuron activation. Experimental results show that CROWN on ReLU networks can notably improve the certified lower bounds compared to the current state-of-the-art algorithm Fast-Lin, while having comparable computational efficiency. Furthermore, CROWN also demonstrates its effectiveness and flexibility on networks with general activation functions, including tanh, sigmoid and arctan.

Huan Zhang, Tsui-Wei Weng, Pin-Yu Chen, Cho-Jui Hsieh, Luca Daniel• 2018

Related benchmarks

TaskDatasetResultRank
Robustness CertificationAttention blocks
Certification Rate12.5
12
Neural Network Verificationoval VNN-COMP 2022
Verification Time (s)23.26
10
Robustness CertificationMNIST Binary
Certified Rate84
10
Image ClassificationFashion MNIST
Clean Accuracy70.8
8
Robustness VerificationCIFAR
Verified Accuracy0.455
8
Neural Network Verificationcifar100-tinyimagenet VNN-COMP 2022
Verification Time (s)11.95
6
Neural Network VerificationSmall-attention experiments
Time per Trial (s)0.1
6
Robustness CertificationResidual-MHA blocks
Certification Score31.3
6
Robustness VerificationMNIST
Verified Accuracy71
6
Robustness VerificationCIFAR-10
Verified Accuracy83.33
6
Showing 10 of 18 rows

Other info

Follow for update