Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Fast Minimum-norm Adversarial Attacks through Adaptive Norm Constraints

About

Evaluating adversarial robustness amounts to finding the minimum perturbation needed to have an input sample misclassified. The inherent complexity of the underlying optimization requires current gradient-based attacks to be carefully tuned, initialized, and possibly executed for many computationally-demanding iterations, even if specialized to a given perturbation model. In this work, we overcome these limitations by proposing a fast minimum-norm (FMN) attack that works with different $\ell_p$-norm perturbation models ($p=0, 1, 2, \infty$), is robust to hyperparameter choices, does not require adversarial starting points, and converges within few lightweight steps. It works by iteratively finding the sample misclassified with maximum confidence within an $\ell_p$-norm constraint of size $\epsilon$, while adapting $\epsilon$ to minimize the distance of the current sample to the decision boundary. Extensive experiments show that FMN significantly outperforms existing attacks in terms of convergence speed and computation time, while reporting comparable or even smaller perturbation sizes.

Maura Pintor, Fabio Roli, Wieland Brendel, Battista Biggio• 2021

Related benchmarks

TaskDatasetResultRank
Adversarial AttackImageNet (test)
Success Rate66.2
101
Adversarial AttackMNIST (test)
Median ||δ||p0.134
21
Adversarial AttackCIFAR10 (test)
Median ||δ||p1.03
6
Adversarial AttackMNIST
Avg Latency (ms)4.33
6
Adversarial AttackCIFAR10 (test)
Median ||δ||p3.04
5
Adversarial AttackCIFAR10
Avg Query Time (ms)26.26
3
Adversarial AttackCIFAR10
Avg Execution Time (ms)25.39
3
Adversarial AttackMNIST
Avg Execution Time (ms/query)5.14
3
Showing 8 of 8 rows

Other info

Code

Follow for update