Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multiplexing Neural Audio Watermarks

About

Audio watermarking is essential for verifying speech authenticity, yet single-watermark schemes often struggle against sophisticated distortions such as neural reconstruction and adversarial attacks. To address this limitation, we introduce a multiplexing paradigm that combines multiple watermarking techniques to leverage their inherent complementarities. We explore both parallel and sequential multiplexing strategies and propose perceptual-adaptive time-frequency multiplexing (PA-TFM), a robust training-free approach. To further enhance performance, we introduce MaskNet, a novel model-based framework designed to learn effective time-domain multiplexing. Experimental results on the LibriSpeech and Common Voice datasets under 14 diverse attack types, including high-strength white-box and neural reconstruction attacks, demonstrate that both PA-TFM and MaskNet considerably outperform existing single-watermark baselines, establishing a resilient paradigm for real-world audio protection.

Zheqi Yuan, Yucheng Huang, Guangzhi Sun, Zengrui Jin, Chao Zhang• 2025

Related benchmarks

TaskDatasetResultRank
Audio WatermarkingAudio Robustness Benchmark averaged across 14 attacks
PESQ4.48
11
Audio Watermarking RobustnessLibriSpeech and Common Voice (test)
No Attack Robustness100
10
Showing 2 of 2 rows

Other info

Follow for update