Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TARO: Temporal Adversarial Rectification Optimization Using Diffusion Models as Purifiers

About

Adversarial purification with diffusion models seeks to project adversarial examples back toward the data manifold, but balancing semantic preservation and robustness against adaptive attacks remains challenging. Recent work shows that standard diffusion purification can fail under adaptive evaluation, while test-time score-based optimization is more resilient. Existing optimization defenses, however, typically rely on a single diffusion noise regime or treat timesteps uniformly, overlooking the distinct roles of coarse and fine denoising scales. We propose Temporal Adversarial Rectification Optimization (TARO), an inference-time purification method that builds a temporally guided score prior from multiple denoising views along the diffusion trajectory. TARO forms a coarse-to-fine residual target: high-noise experts provide globally smoothed structure with reduced adversarial sensitivity, while low-noise experts restore image-specific, class-relevant details. A guidance strength controls this temporal correction, allowing TARO to balance robust global rectification with semantic preservation. Empirically, TARO improves robust accuracy across datasets and adaptive threat models in a zero-shot setting, while remaining compatible with complementary adversarial-likelihood objectives for further robustness gains.

Daniel Wesego, Pedram Rooshenas• 2026

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10
Clean Accuracy93.55
75
Robust Image ClassificationCIFAR-10--
68
Image ClassificationCIFAR-10C
Brightness Acc91.3
28
Image ClassificationCIFAR-10 (test)
Clean Accuracy93.59
12
Image ClassificationCIFAR-100
Clean Accuracy74.72
10
Image ClassificationImagenette
Clean Accuracy81.6
5
Image ClassificationCIFAR-10 (test)
Robust Accuracy90.1
4
Showing 7 of 7 rows

Other info

Follow for update