Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Audio Pirates: Black-box Audio Watermark Removal via Diffusion Priors

About

With the rise of AI-generated audio, watermarking has become widely used for detecting misuse and protecting intellectual property. However, adversaries may try to remove these watermarks, making it critical to evaluate how well watermarking schemes withstand removal attacks. Existing attacks are often impractical: they either noticeably degrade perceptual quality or require access to the watermarking scheme. We propose DiffErase, a black-box watermark removal attack that assumes no knowledge of the target watermarking scheme while maintaining perceptual quality. DiffErase perturbs watermarked audio to an intermediate diffusion noise level and regenerates it using a pretrained denoising model, effectively suppressing watermark signals. Theoretical analysis and extensive experiments demonstrate that inaudible audio watermarks are highly vulnerable: across multiple audio domains, DiffErase consistently removes watermarks while preserving perceptual quality. These findings highlight the need for future audio watermarking designs to consider diffusion-based threats. Code and demos are available at https://differase.github.io/DiffErase/.

Lingfeng Yao, Xincong Zhong, Chenpei Huang, Xuandong Zhao, Hanqing Guo, Aohan Li, Jiang Liu, Tomoaki Ohtsuki, Miao Pan• 2026

Related benchmarks

TaskDatasetResultRank
Watermark RemovalLibriSpeech speech domain official releases (test)
SQUIM-MOS4.423
10
Audio Watermark RemovalFMA small
ViSQOL Score4.163
10
Audio Quality AssessmentClotho 1.0 (test)
ViSQOL3.952
10
Watermark DetectionClotho 1.0 (test)
Perth23
10
Audio Watermark RemovalSpeech domain
MUSHRA Score (Before Attack)98.38
5
Audio Watermark RemovalMusic domain
MUSHRA (Before Attack)95.62
5
Audio Watermark RemovalEnvironmental domain
MUSHRA (Before Attack)94.88
5
Showing 7 of 7 rows

Other info

Follow for update