Better Diffusion Models Further Improve Adversarial Training
About
It has been recognized that the data generated by the denoising diffusion probabilistic model (DDPM) improves adversarial training. After two years of rapid development in diffusion models, a question naturally arises: can better diffusion models further improve adversarial training? This paper gives an affirmative answer by employing the most recent diffusion model which has higher efficiency ($\sim 20$ sampling steps) and image quality (lower FID score) compared with DDPM. Our adversarially trained models achieve state-of-the-art performance on RobustBench using only generated data (no external datasets). Under the $\ell_\infty$-norm threat model with $\epsilon=8/255$, our models achieve $70.69\%$ and $42.67\%$ robust accuracy on CIFAR-10 and CIFAR-100, respectively, i.e. improving upon previous state-of-the-art models by $+4.58\%$ and $+8.03\%$. Under the $\ell_2$-norm threat model with $\epsilon=128/255$, our models achieve $84.86\%$ on CIFAR-10 ($+4.44\%$). These results also beat previous works that use external data. We also provide compelling results on the SVHN and TinyImageNet datasets. Our code is available at https://github.com/wzekai99/DM-Improves-AT.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | CIFAR-10 (test) | Accuracy (Clean)93.25 | 273 | |
| Adversarial Robustness | CIFAR-10 (test) | -- | 76 | |
| Image Classification | CIFAR-100 (test) | Clean Accuracy75.22 | 61 | |
| Image Classification | CIFAR-10 512-image subset (test) | Clean Accuracy95.9 | 26 | |
| Image Classification | CIFAR-10 (test) | Clean Accuracy95.9 | 19 | |
| Image Classification | CIFAR100 (test) | Natural Accuracy37.25 | 16 | |
| Image Classification | Tiny ImageNet (val) | Clean Accuracy65.2 | 11 | |
| Image Classification | CIFAR-10 8/255 perturbation (test) | Clean Accuracy92.44 | 10 | |
| Adversarial Attack Detection | CIFAR10 l2, epsilon=0.5 (test) | Kendall Tau Correlation0.74 | 10 | |
| Adversarial Robustness | CIFAR-100 l_inf, epsilon=8/255 (test) | Clean Accuracy72.58 | 9 |