Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing

About

Face Anti-Spoofing (FAS) is crucial for securing face recognition systems against presentation attacks. With advancements in sensor manufacture and multi-modal learning techniques, many multi-modal FAS approaches have emerged. However, they face challenges in generalizing to unseen attacks and deployment conditions. These challenges arise from (1) modality unreliability, where some modality sensors like depth and infrared undergo significant domain shifts in varying environments, leading to the spread of unreliable information during cross-modal feature fusion, and (2) modality imbalance, where training overly relies on a dominant modality hinders the convergence of others, reducing effectiveness against attack types that are indistinguishable sorely using the dominant modality. To address modality unreliability, we propose the Uncertainty-Guided Cross-Adapter (U-Adapter) to recognize unreliably detected regions within each modality and suppress the impact of unreliable regions on other modalities. For modality imbalance, we propose a Rebalanced Modality Gradient Modulation (ReGrad) strategy to rebalance the convergence speed of all modalities by adaptively adjusting their gradients. Besides, we provide the first large-scale benchmark for evaluating multi-modal FAS performance under domain generalization scenarios. Extensive experiments demonstrate that our method outperforms state-of-the-art methods. Source code and protocols will be released on https://github.com/OMGGGGG/mmdg.

Xun Lin, Shuai Wang, Rizhao Cai, Yizhong Liu, Ying Fu, Zitong Yu, Wenzhong Tang, Alex Kot• 2024

Related benchmarks

TaskDatasetResultRank
Face Anti-SpoofingCASIA-CeFA (C), PADISI (P), CASIA-SURF (S), and WMCA (W) Protocol 2, Missing D
HTER23.39
49
Face Anti-SpoofingCASIA-CeFA, WMCA, PADISI-USC, CASIA-SURF CW -> PS Protocol 3 (Cross-dataset)
HTER20.12
10
Face Anti-SpoofingPS -> CW (PADISI-USC, CASIA-SURF to CASIA-CeFA, WMCA) Protocol 3 (Cross-dataset)
HTER36.6
10
Multi-modal Face Anti-SpoofingWMCA Protocol 1, CPS -> W 1.0 (test)
HTER (%)12.79
10
Multi-modal Face Anti-SpoofingCASIA-SURF Protocol 1, CPW -> S 1.0 (test)
HTER15.32
10
Multi-modal Face Anti-SpoofingPADISI-Face Protocol 1, CSW -> P 1.0 (test)
HTER (%)18.95
10
Multi-modal Face Anti-SpoofingCASIA-CeFA Protocol 1, PSW -> C 1.0 (test)
HTER29.93
10
Showing 7 of 7 rows

Other info

Code

Follow for update