Multi-scale Diffusion Denoised Smoothing
About
Along with recent diffusion models, randomized smoothing has become one of a few tangible approaches that offers adversarial robustness to models at scale, e.g., those of large pre-trained models. Specifically, one can perform randomized smoothing on any classifier via a simple "denoise-and-classify" pipeline, so-called denoised smoothing, given that an accurate denoiser is available - such as diffusion model. In this paper, we present scalable methods to address the current trade-off between certified robustness and accuracy in denoised smoothing. Our key idea is to "selectively" apply smoothing among multiple noise scales, coined multi-scale smoothing, which can be efficiently implemented with a single diffusion model. This approach also suggests a new objective to compare the collective robustness of multi-scale smoothed classifiers, and questions which representation of diffusion model would maximize the objective. To address this, we propose to further fine-tune diffusion model (a) to perform consistent denoising whenever the original image is recoverable, but (b) to generate rather diverse outputs otherwise. Our experiments show that the proposed multi-scale smoothing scheme combined with diffusion fine-tuning enables strong certified robustness available with high noise level while maintaining its accuracy close to non-smoothed classifiers.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | ImageNet A | Top-1 Acc41.7 | 553 | |
| Image Classification | ImageNet-R | Top-1 Acc69.8 | 474 | |
| Image Classification | CIFAR-10 corrupted (test) | Acc89.4 | 30 | |
| Certified Robustness | CIFAR-10 (test) | Accuracy (Standard)94.5 | 26 | |
| Image Classification | CIFAR-10.1 1.0 (test) | Accuracy78.5 | 14 | |
| Certified Accuracy | CIFAR-10 (test) | Certified Accuracy (r=0.0)76.51 | 9 | |
| Image Classification | ImageNet IN-1K (val) | Empirical Accuracy83.8 | 7 | |
| Certified Accuracy | ImageNet (val) | Certified Accuracy (Radius 0.0)72.5 | 7 | |
| Certified Robustness | CIFAR-10 | Certified Acc (eps=0.0)90.2 | 6 |