Autoregressive Perturbations for Data Poisoning

About

The prevalence of data scraping from social media as a means to obtain datasets has led to growing concerns regarding unauthorized use of data. Data poisoning attacks have been proposed as a bulwark against scraping, as they make data "unlearnable" by adding small, imperceptible perturbations. Unfortunately, existing methods require knowledge of both the target architecture and the complete dataset so that a surrogate network can be trained, the parameters of which are used to generate the attack. In this work, we introduce autoregressive (AR) poisoning, a method that can generate poisoned data without access to the broader dataset. The proposed AR perturbations are generic, can be applied across different datasets, and can poison different architectures. Compared to existing unlearnable methods, our AR poisons are more resistant against common defenses such as adversarial training and strong data augmentations. Our analysis further provides insight into what makes an effective data poison.

Pedro Sandoval-Segura, Vasu Singla, Jonas Geiping, Micah Goldblum, Tom Goldstein, David W. Jacobs• 2022

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-10 (test)	Accuracy88.8	3381
Semantic segmentation	ADE20K (val)	mIoU43.9	3069
Image Classification	Tiny ImageNet (test)	Accuracy68.15	722
Image Classification	CIFAR-100 (test)	Top-1 Accuracy63.48	395
Semantic segmentation	Cityscapes (val)	mIoU68.9	301
Panoptic Segmentation	Cityscapes (val)	PQ51.6	288
Instance Segmentation	Cityscapes (val)	AP35.5	247
Panoptic Segmentation	ADE20K (val)	PQ37.8	99
Image Classification	CIFAR-10 (test)	Clean Accuracy66.91	40
Image Classification	CIFAR-100 (test)	Clean Accuracy (CIFAR-100 Test)63.77	40

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord