Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HoliAntiSpoof: Audio LLM for Holistic Speech Anti-Spoofing

About

Recent advances in speech synthesis and editing have made speech spoofing increasingly challenging. However, most existing methods treat spoofing as binary classification, overlooking that diverse spoofing techniques manipulate multiple, coupled speech attributes and their semantic effects. In this paper, we introduce HoliAntiSpoof, the first audio large language model (ALLM) framework for holistic speech anti-spoofing analysis. HoliAntiSpoof reformulates spoofing analysis as a unified text generation task, enabling joint reasoning over spoofing methods, affected speech attributes, and their semantic impacts. To support semantic-level analysis, we introduce DailyTalkEdit, a new anti-spoofing benchmark that simulates realistic conversational manipulations and provides annotations of semantic influence. Extensive experiments demonstrate that HoliAntiSpoof outperforms conventional baselines across multiple settings, while preliminary results show that in-context learning further improves out-of-domain generalization. These findings indicate that ALLMs not only enhance speech spoofing detection performance but also enable interpretable analysis of spoofing behaviors and their semantic effects, pointing towards more trustworthy and explainable speech security. Data and code are publicly available.

Xuenan Xu, Yiming Ren, Liwei Liu, Wen Wu, Baoxiang Li, Chaochao Lu, Shuai Wang, Chao Zhang• 2026

Related benchmarks

TaskDatasetResultRank
Authenticity ClassificationMixed In-Domain
Accuracy96.16
11
Authenticity ClassificationSF-MD Out Domain
Accuracy97.89
11
Authenticity ClassificationSpoofCeleb Out Domain
Accuracy90.36
11
Authenticity ClassificationHAD Out Domain
Accuracy90.19
11
Speech Anti-SpoofingMixed In-Domain
EER0.0306
11
Speech Anti-SpoofingSpeechFake MultiLingual SF-MD
EER0.39
11
Speech Anti-SpoofingSpoofCeleb
EER8.52
11
Spoofing-aware speaker verificationASVspoof 2019 (eval)
EER0.011
11
Spoofing Method IdentificationMixed In-Domain
Accuracy95.12
11
Authenticity ClassificationASV19 In Domain
Accuracy96.59
10
Showing 10 of 12 rows

Other info

Follow for update