Audio Pirates: Black-box Audio Watermark Removal via Diffusion Priors

About

With the rise of AI-generated audio, watermarking has become widely used for detecting misuse and protecting intellectual property. However, adversaries may try to remove these watermarks, making it critical to evaluate how well watermarking schemes withstand removal attacks. Existing attacks are often impractical: they either noticeably degrade perceptual quality or require access to the watermarking scheme. We propose DiffErase, a black-box watermark removal attack that assumes no knowledge of the target watermarking scheme while maintaining perceptual quality. DiffErase perturbs watermarked audio to an intermediate diffusion noise level and regenerates it using a pretrained denoising model, effectively suppressing watermark signals. Theoretical analysis and extensive experiments demonstrate that inaudible audio watermarks are highly vulnerable: across multiple audio domains, DiffErase consistently removes watermarks while preserving perceptual quality. These findings highlight the need for future audio watermarking designs to consider diffusion-based threats. Code and demos are available at https://differase.github.io/DiffErase/.

Lingfeng Yao, Xincong Zhong, Chenpei Huang, Xuandong Zhao, Hanqing Guo, Aohan Li, Jiang Liu, Tomoaki Ohtsuki, Miao Pan• 2026

Related benchmarks

Task	Dataset	Result
Watermark Removal	LibriSpeech speech domain official releases (test)	SQUIM-MOS4.423	10
Audio Watermark Removal	FMA small	ViSQOL Score4.163	10
Audio Quality Assessment	Clotho 1.0 (test)	ViSQOL3.952	10
Watermark Detection	Clotho 1.0 (test)	Perth23	10
Audio Watermark Removal	Speech domain	MUSHRA Score (Before Attack)98.38	5
Audio Watermark Removal	Music domain	MUSHRA (Before Attack)95.62	5
Audio Watermark Removal	Environmental domain	MUSHRA (Before Attack)94.88	5

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord