HyperPotter: Spell the Charm of High-Order Interactions in Audio Deepfake Detection

About

Advances in AIGC technologies have enabled the synthesis of highly realistic audio deepfakes capable of deceiving human auditory perception. Although numerous audio deepfake detection (ADD) methods have been developed, most rely on local temporal/spectral features or pairwise relations, overlooking high-order interactions (HOIs). HOIs capture discriminative patterns that emerge from multiple feature components beyond their individual contributions. We propose HyperPotter, a hypergraph-based framework designed to capture high-order relations associated with synergistic patterns through clustering-based hyperedges with class-aware prototype initialization. Extensive experiments on 13 test sets show that HyperPotter improves over the baseline on 11 sets, yielding an average relative EER reduction of 12.68\% across all test sets and 22.15\% on the improved sets. These results demonstrate strong cross-scenario generalization, while also revealing robustness limits under severe codec or channel distortion.

Qing Wen, Haohao Li, Zhongjie Ba, Peng Cheng, Miao He, Li Lu, Kui Ren• 2026

Related benchmarks

Task	Dataset	Result
Audio Deepfake Detection	ASVspoof DF 2021	EER1.78	87
Audio Deepfake Detection	in the wild	EER5.72	76
Audio Deepfake Detection	CodecFake	EER34.47	50
Audio Deepfake Detection	ASVspoof LA 2019	EER23	38
Audio Deepfake Detection	FoR	EER3.89	28
Audio Deepfake Detection	ADD Track 3 2022	EER11.31	19
Audio Deepfake Detection	ADD 2023 R2	EER21.84	19
Audio Deepfake Detection	ADD 2023 R1	EER21.49	19
Audio Deepfake Detection	ADD Track 1 2022	EER32.34	19
Audio Deepfake Detection	SONAR	EER27.71	19

Showing 10 of 22 rows

Other info

Follow for update

@wizwand_team Discord