FDA: Feature Disruptive Attack

About

Though Deep Neural Networks (DNN) show excellent performance across various computer vision tasks, several works show their vulnerability to adversarial samples, i.e., image samples with imperceptible noise engineered to manipulate the network's prediction. Adversarial sample generation methods range from simple to complex optimization techniques. Majority of these methods generate adversaries through optimization objectives that are tied to the pre-softmax or softmax output of the network. In this work we, (i) show the drawbacks of such attacks, (ii) propose two new evaluation metrics: Old Label New Rank (OLNR) and New Label Old Rank (NLOR) in order to quantify the extent of damage made by an attack, and (iii) propose a new adversarial attack FDA: Feature Disruptive Attack, to address the drawbacks of existing attacks. FDA works by generating image perturbation that disrupt features at each layer of the network and causes deep-features to be highly corrupt. This allows FDA adversaries to severely reduce the performance of deep networks. We experimentally validate that FDA generates stronger adversaries than other state-of-the-art methods for image classification, even in the presence of various defense measures. More importantly, we show that FDA disrupts feature-representation based tasks even without access to the task-specific network or methodology. Code available at: https://github.com/BardOfCodes/fda

Aditya Ganeshan, B.S. Vivek, R. Venkatesh Babu• 2019

Related benchmarks

Task	Dataset	Result
Image Classification	SVHN (test)	--	199
Image Classification	ImageNet ILSVRC 1000 images 2012 (val)	Attack Success Rate99.6	180
Visual Reasoning	NLVR2	--	49
Image Captioning	MSCOCO (test)	CIDEr55.75	29
Visual Entailment	SNLI-VE	Accuracy0.209	24
REC	RefCOCO+	ASR76.21	16
REC	RefCOCOg	ASR78.64	16
Image Classification	ImageNet-1K	ASR12.31	4

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord