Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HyperPotter: Spell the Charm of High-Order Interactions in Audio Deepfake Detection

About

Advances in AIGC technologies have enabled the synthesis of highly realistic audio deepfakes capable of deceiving human auditory perception. Although numerous audio deepfake detection (ADD) methods have been developed, most rely on local temporal/spectral features or pairwise relations, overlooking high-order interactions (HOIs). HOIs capture discriminative patterns that emerge from multiple feature components beyond their individual contributions. We propose HyperPotter, a hypergraph-based framework that explicitly models these synergistic HOIs through clustering-based hyperedges with class-aware prototype initialization. Extensive experiments demonstrate that HyperPotter surpasses its baseline by an average relative gain of 22.15% across 11 datasets and outperforms state-of-the-art methods by 13.96% on 4 challenging cross-domain datasets, demonstrating superior generalization to diverse attacks and speakers.

Qing Wen, Haohao Li, Zhongjie Ba, Peng Cheng, Miao He, Li Lu, Kui Ren• 2026

Related benchmarks

TaskDatasetResultRank
Audio Deepfake Detectionin the wild
EER5.72
58
Audio Deepfake DetectionASVspoof DF 2021
EER1.78
35
Audio Deepfake DetectionASVspoof LA 2021
EER3.94
12
Audio Deepfake DetectionASVspoof LA 2019
EER23
11
Audio Deepfake DetectionASVspoof 5
EER16.04
9
Audio Deepfake DetectionASVspoof DF 2021
F1 Score75.4
7
Audio Deepfake DetectionFoR
F1 Score96.1
7
Audio Deepfake DetectionADD Track 3 2022
F1 Score72
7
Audio Deepfake DetectionADD Track 3 2022
EER11.31
7
Audio Deepfake DetectionCodecFake
F1 Score53.7
7
Showing 10 of 22 rows

Other info

Follow for update