Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LiBRe: A Practical Bayesian Approach to Adversarial Detection

About

Despite their appealing flexibility, deep neural networks (DNNs) are vulnerable against adversarial examples. Various adversarial defense strategies have been proposed to resolve this problem, but they typically demonstrate restricted practicability owing to unsurmountable compromise on universality, effectiveness, or efficiency. In this work, we propose a more practical approach, Lightweight Bayesian Refinement (LiBRe), in the spirit of leveraging Bayesian neural networks (BNNs) for adversarial detection. Empowered by the task and attack agnostic modeling under Bayes principle, LiBRe can endow a variety of pre-trained task-dependent DNNs with the ability of defending heterogeneous adversarial attacks at a low cost. We develop and integrate advanced learning techniques to make LiBRe appropriate for adversarial detection. Concretely, we build the few-layer deep ensemble variational and adopt the pre-training & fine-tuning workflow to boost the effectiveness and efficiency of LiBRe. We further provide a novel insight to realise adversarial detection-oriented uncertainty quantification without inefficiently crafting adversarial examples during training. Extensive empirical studies covering a wide range of scenarios verify the practicability of LiBRe. We also conduct thorough ablation studies to evidence the superiority of our modeling and learning strategies.

Zhijie Deng, Xiao Yang, Shizhen Xu, Hang Su, Jun Zhu• 2021

Related benchmarks

TaskDatasetResultRank
Adversarial Attack DetectionFace dataset Adv-Mask attack
AUROC99.93
5
Adversarial Attack DetectionFace dataset Adv-Sticker attack
AUROC0.9739
5
Adversarial Attack DetectionFace dataset Adv-Glasses attack
AUROC81.4
5
Adversarial Attack DetectionFace dataset TIPIM attack
AUROC0.5345
5
Adversarial Attack DetectionImageNet100
Robustness (BIM)92.59
5
Adversarial DetectionImageNet MIFGSM attack (test)
AUROC87.25
5
Showing 6 of 6 rows

Other info

Follow for update