BppAttack: Stealthy and Efficient Trojan Attacks against Deep Neural Networks via Image Quantization and Contrastive Adversarial Learning

About

Deep neural networks are vulnerable to Trojan attacks. Existing attacks use visible patterns (e.g., a patch or image transformations) as triggers, which are vulnerable to human inspection. In this paper, we propose stealthy and efficient Trojan attacks, BppAttack. Based on existing biology literature on human visual systems, we propose to use image quantization and dithering as the Trojan trigger, making imperceptible changes. It is a stealthy and efficient attack without training auxiliary models. Due to the small changes made to images, it is hard to inject such triggers during training. To alleviate this problem, we propose a contrastive learning based approach that leverages adversarial attacks to generate negative sample pairs so that the learned trigger is precise and accurate. The proposed method achieves high attack success rates on four benchmark datasets, including MNIST, CIFAR-10, GTSRB, and CelebA. It also effectively bypasses existing Trojan defenses and human inspection. Our code can be found in https://github.com/RU-System-Software-and-Security/BppAttack.

Zhenting Wang, Juan Zhai, Shiqing Ma• 2022

Related benchmarks

Task	Dataset	Result
Backdoor Defense	Tiny-ImageNet	Accuracy87.48	196
Backdoor Attack	CIFAR10	Attack Success Rate98.3	158
Backdoor Attack	GTSRB	Attack Success Rate97.1	142
Image Classification	GTSRB	CA92.8	121
Backdoor Attack	MNIST (test)	Classification Accuracy (C-Acc)99.58	88
Image Classification	MNIST	Standard Accuracy99.6	54
Image Classification	TinyImageNet	C-Acc87.5	42
Image Classification	CIFAR-10	C-Acc91.6	42
Backdoor Attack	CelebA	Backdoor Attack Rate (BA)77.51	37

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord