Simple Black-box Adversarial Attacks

About

We propose an intriguingly simple method for the construction of adversarial images in the black-box setting. In constrast to the white-box scenario, constructing black-box adversarial images has the additional constraint on query budget, and efficient attacks remain an open problem to date. With only the mild assumption of continuous-valued confidence scores, our highly query-efficient algorithm utilizes the following simple iterative principle: we randomly sample a vector from a predefined orthonormal basis and either add or subtract it to the target image. Despite its simplicity, the proposed method can be used for both untargeted and targeted attacks -- resulting in previously unprecedented query efficiency in both settings. We demonstrate the efficacy and efficiency of our algorithm on several real world settings including the Google Cloud Vision API. We argue that our proposed algorithm should serve as a strong baseline for future black-box attacks, in particular because it is extremely fast and its implementation requires less than 20 lines of PyTorch code.

Chuan Guo, Jacob R. Gardner, Yurong You, Andrew Gordon Wilson, Kilian Q. Weinberger• 2019

Related benchmarks

Task	Dataset	Result
Point Cloud Adversarial Attack	ModelNet40 (test)	ASR100	123
Untargeted Score-based Black-box Attack	ImageNet	ASR100	96
Targeted Score-based Black-box Attack	ImageNet	ASR97.5	96
Targeted Adversarial Attack	CIFAR-10	ASR73.98	43
Untargeted Adversarial Attack	ImageNet (test)	--	26
Adversarial Attack	AADD-LQ (surrogate)	ASR0.176	24
Adversarial Attack	AADD-LQ (blind)	ASR0.4	12
Targeted Score-based Black-box Attack	Food101	ASR69.7	6
Targeted Score-based Black-box Attack	ObjectNet	ASR18	6
Untargeted Score-based Black-box Attack	Food101	ASR97.5	6

Showing 10 of 16 rows

Other info

Follow for update

@wizwand_team Discord