Backdoor Attack in the Physical World

About

Backdoor attack intends to inject hidden backdoor into the deep neural networks (DNNs), such that the prediction of infected models will be maliciously changed if the hidden backdoor is activated by the attacker-defined trigger. Currently, most existing backdoor attacks adopted the setting of static trigger, $i.e.,$ triggers across the training and testing images follow the same appearance and are located in the same area. In this paper, we revisit this attack paradigm by analyzing trigger characteristics. We demonstrate that this attack paradigm is vulnerable when the trigger in testing images is not consistent with the one used for training. As such, those attacks are far less effective in the physical world, where the location and appearance of the trigger in the digitized image may be different from that of the one used for training. Moreover, we also discuss how to alleviate such vulnerability. We hope that this work could inspire more explorations on backdoor properties, to help the design of more advanced backdoor attack and defense methods.

Yiming Li, Tongqing Zhai, Yong Jiang, Zhifeng Li, Shu-Tao Xia• 2021

Related benchmarks

Task	Dataset	Result
Backdoor Defense	CIFAR-10	Attack Success Rate49.5	78
Backdoor Defense	CIFAR-10	Clean Accuracy97.3	64
Backdoor Defense	GTSRB	PA0.051	21
Image Classification	GTSRB 32 x 32 43 classes (test)	Accuracy (CA)97.41	17
Image Classification	CIFAR-10 32 x 32 (test)	CA82.84	17
Image Classification	Imagenette 256 x 256 10 classes (test)	Classification Accuracy90.21	17
Backdoor Attack	Backdoor Attack Evaluation Summary	Poison Rate0.5	10
Backdoor Defense	Fashion MNIST	Clean Accuracy17.5	8

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord