Rethinking Backdoor Attacks on Dataset Distillation: A Kernel Method Perspective
About
Dataset distillation offers a potential means to enhance data efficiency in deep learning. Recent studies have shown its ability to counteract backdoor risks present in original training samples. In this study, we delve into the theoretical aspects of backdoor attacks and dataset distillation based on kernel methods. We introduce two new theory-driven trigger pattern generation methods specialized for dataset distillation. Following a comprehensive set of analyses and experiments, we show that our optimization-based trigger design framework informs effective backdoor attacks on dataset distillation. Notably, datasets poisoned by our designed trigger prove resilient against conventional backdoor attack detection and mitigation methods. Our empirical results validate that the triggers developed using our approaches are proficient at executing resilient backdoor attacks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Backdoor Attack | FMNIST | ASR100 | 75 | |
| Backdoor Attack | CIFAR10 | Attack Success Rate100 | 70 | |
| Backdoor Attack in Dataset Condensation | CIFAR10 | Clean Trigger Accuracy (CTA)72.7 | 43 | |
| Backdoor Attack in Dataset Condensation | FMNIST | CTA88 | 43 | |
| Backdoor Attack in Dataset Condensation | Tiny-ImageNet | CTA50.9 | 43 | |
| Backdoor Attack in Dataset Condensation | SVHN | CTA88 | 43 | |
| Backdoor Attack Stealthiness Evaluation | CIFAR10 | SSIM0.69 | 40 | |
| Backdoor Attack in Dataset Condensation | STL10 | CTA72.38 | 31 | |
| Backdoor Attack | SVHN | Attack Success Rate100 | 27 | |
| Backdoor Attack | STL10 | Attack Success Rate (ASR)100 | 21 |