BUT Systems for WildSpoof Challenge: SASV in the Wild

About

This paper presents the BUT submission to the WildSpoof Challenge, focusing on the Spoofing-robust Automatic Speaker Verification (SASV) track. We propose a SASV framework designed to bridge the gap between general audio understanding and specialized speech analysis. Our subsystem integrates diverse Self-Supervised Learning front-ends ranging from general audio models (e.g., Dasheng) to speech-specific encoders (e.g., WavLM). These representations are aggregated via a lightweight Multi-Head Factorized Attention back-end for corresponding subtasks. Furthermore, we introduce a feature domain augmentation strategy based on Distribution Uncertainty to explicitly model and mitigate the domain shift caused by unseen neural vocoders and recording environments. By fusing these robust CM scores with state-of-the-art ASV systems, our approach achieves superior minimization of the a-DCFs and EERs.

Junyi Peng, Jin Li, Johan Rohdin, Lin Zhang, Miroslav Hlav\'a\v{c}ek, Oldrich Plchot• 2025

Related benchmarks

Task	Dataset	Result
Speaker Verification	VoxCeleb1 (Vox1-O)	EER22.9	160
Spoofing-aware speaker verification	SpoofCeleb (eval set)	--	17
Fake Detection	ASVspoof5 (dev)	EER1.193	16
Speaker Verification	VoxCeleb Extended 1	--	15
Speaker Verification	VoxCeleb Hard 1	--	15
Anti-spoofing	SpoofCeleb (dev)	EER (%)21.3	9
Anti-spoofing	SpoofCeleb (Eval)	EER0.078	9
Automatic Speaker Verification	SpoofCeleb (dev)	EER2.441	3

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord