Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

BUT Systems for WildSpoof Challenge: SASV in the Wild

About

This paper presents the BUT submission to the WildSpoof Challenge, focusing on the Spoofing-robust Automatic Speaker Verification (SASV) track. We propose a SASV framework designed to bridge the gap between general audio understanding and specialized speech analysis. Our subsystem integrates diverse Self-Supervised Learning front-ends ranging from general audio models (e.g., Dasheng) to speech-specific encoders (e.g., WavLM). These representations are aggregated via a lightweight Multi-Head Factorized Attention back-end for corresponding subtasks. Furthermore, we introduce a feature domain augmentation strategy based on Distribution Uncertainty to explicitly model and mitigate the domain shift caused by unseen neural vocoders and recording environments. By fusing these robust CM scores with state-of-the-art ASV systems, our approach achieves superior minimization of the a-DCFs and EERs.

Junyi Peng, Jin Li, Johan Rohdin, Lin Zhang, Miroslav Hlav\'a\v{c}ek, Oldrich Plchot• 2025

Related benchmarks

TaskDatasetResultRank
Speaker VerificationVoxCeleb1 (Vox1-O)
EER22.9
33
Fake DetectionASVspoof5 (dev)
EER1.193
16
Speaker VerificationVoxCeleb Extended 1--
15
Speaker VerificationVoxCeleb Hard 1--
15
Spoofing-aware speaker verificationSpoofCeleb (eval set)--
13
Anti-spoofingSpoofCeleb (dev)
EER (%)21.3
9
Anti-spoofingSpoofCeleb (Eval)
EER0.078
9
Automatic Speaker VerificationSpoofCeleb (dev)
EER2.441
3
Showing 8 of 8 rows

Other info

Follow for update