Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MultiAPI Spoof: A Multi-API Dataset and Local-Attention Network for Speech Anti-spoofing Detection

About

Existing speech anti-spoofing benchmarks rely on a narrow set of public models, creating a substantial gap from real-world scenarios in which commercial systems employ diverse, often proprietary APIs. To address this issue, we introduce MultiAPI Spoof, a multi-API audio anti-spoofing dataset comprising about 230 hours of synthetic speech generated by 30 distinct APIs, including commercial services, open-source models, and online platforms. Based on this dataset, we define the API tracing task, enabling fine-grained attribution of spoofed audio to its generation source. We further propose Nes2Net-LA, a local-attention enhanced variant of Nes2Net that improves local context modeling and fine-grained spoofing feature extraction. Experiments show that Nes2Net-LA achieves state-of-the-art performance and offers superior robustness, particularly under diverse and unseen spoofing conditions. Code \footnote{https://github.com/XuepingZhang/MultiAPI-Spoof} and dataset \footnote{https://xuepingzhang.github.io/MultiAPI-Spoof-Dataset/} have released.

Xueping Zhang, Zhenshan Zhang, Yechen Wang, Linxi Li, Liwei Jin, Ming Li• 2025

Related benchmarks

TaskDatasetResultRank
Speech Spoofing DetectionIn-the-Wild (ITW) (eval)
EER1.42
19
Anti-spoofingITW (test)
EER0.0142
6
Anti-spoofingMultiAPI Spoof (test)
Overall EER0.56
6
Anti-spoofingAI4T (test)
EER5.64
6
Speech Anti-SpoofingAI4T
EER5.64
6
API TracingMultiAPI Spoof 1.0 (dev)
Precision95.9
3
API TracingMultiAPI Spoof 1.0 (eval)
Precision0.972
3
Showing 7 of 7 rows

Other info

Follow for update