HoliAntiSpoof: Audio LLM for Holistic Speech Anti-Spoofing

About

Recent advances in speech synthesis and editing have made speech spoofing increasingly challenging. However, most existing methods treat spoofing as binary classification, overlooking that diverse spoofing techniques manipulate multiple, coupled speech attributes and their semantic effects. In this paper, we introduce HoliAntiSpoof, the first audio large language model (ALLM) framework for holistic speech anti-spoofing analysis. HoliAntiSpoof reformulates spoofing analysis as a unified text generation task, enabling joint reasoning over spoofing methods, affected speech attributes, and their semantic impacts. To support semantic-level analysis, we introduce DailyTalkEdit, a new anti-spoofing benchmark that simulates realistic conversational manipulations and provides annotations of semantic influence. Extensive experiments demonstrate that HoliAntiSpoof outperforms conventional baselines across multiple settings, while preliminary results show that in-context learning further improves out-of-domain generalization. These findings indicate that ALLMs not only enhance speech spoofing detection performance but also enable interpretable analysis of spoofing behaviors and their semantic effects, pointing towards more trustworthy and explainable speech security. Data and code are publicly available.

Xuenan Xu, Yiming Ren, Liwei Liu, Wen Wu, Baoxiang Li, Chaochao Lu, Shuai Wang, Chao Zhang• 2026

Related benchmarks

Task	Dataset	Result
Authenticity Classification	Mixed In-Domain	Accuracy96.16	11
Authenticity Classification	SF-MD Out Domain	Accuracy97.89	11
Authenticity Classification	SpoofCeleb Out Domain	Accuracy90.36	11
Authenticity Classification	HAD Out Domain	Accuracy90.19	11
Speech Anti-Spoofing	Mixed In-Domain	EER0.0306	11
Speech Anti-Spoofing	SpeechFake MultiLingual SF-MD	EER0.39	11
Speech Anti-Spoofing	SpoofCeleb	EER8.52	11
Spoofing-aware speaker verification	ASVspoof 2019 (eval)	EER0.011	11
Spoofing Method Identification	Mixed In-Domain	Accuracy95.12	11
Authenticity Classification	ASV19 In Domain	Accuracy96.59	10

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord