Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Probabilistic Margins for Instance Reweighting in Adversarial Training

About

Reweighting adversarial data during training has been recently shown to improve adversarial robustness, where data closer to the current decision boundaries are regarded as more critical and given larger weights. However, existing methods measuring the closeness are not very reliable: they are discrete and can take only a few values, and they are path-dependent, i.e., they may change given the same start and end points with different attack paths. In this paper, we propose three types of probabilistic margin (PM), which are continuous and path-independent, for measuring the aforementioned closeness and reweighting adversarial data. Specifically, a PM is defined as the difference between two estimated class-posterior probabilities, e.g., such the probability of the true label minus the probability of the most confusing label given some natural data. Though different PMs capture different geometric properties, all three PMs share a negative correlation with the vulnerability of data: data with larger/smaller PMs are safer/riskier and should have smaller/larger weights. Experiments demonstrate that PMs are reliable measurements and PM-based reweighting methods outperform state-of-the-art methods.

Qizhou Wang, Feng Liu, Bo Han, Tongliang Liu, Chen Gong, Gang Niu, Mingyuan Zhou, Masashi Sugiyama• 2021

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100--
622
Image ClassificationStanford Cars
Accuracy80.67
477
Image ClassificationAircraft
Accuracy80.12
302
Image ClassificationImageNet-1K
Accuracy75.23
190
Image ClassificationOxford-IIIT Pet
Accuracy92.18
161
Image ClassificationImageNet-A (test)--
154
Image ClassificationImageNet-100
Accuracy86.34
84
Adversarial RobustnessCIFAR-10 (test)--
76
Adversarial RobustnessCIFAR-100 (test)
Natural Acc60.74
46
Image ClassificationiWILDCam OOD
Accuracy71.4
26
Showing 10 of 13 rows

Other info

Follow for update