Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SilentCipher: Deep Audio Watermarking

About

In the realm of audio watermarking, it is challenging to simultaneously encode imperceptible messages while enhancing the message capacity and robustness. Although recent advancements in deep learning-based methods bolster the message capacity and robustness over traditional methods, the encoded messages introduce audible artefacts that restricts their usage in professional settings. In this study, we introduce three key innovations. Firstly, our work is the first deep learning-based model to integrate psychoacoustic model based thresholding to achieve imperceptible watermarks. Secondly, we introduce psuedo-differentiable compression layers, enhancing the robustness of our watermarking algorithm. Lastly, we introduce a method to eliminate the need for perceptual losses, enabling us to achieve SOTA in both robustness as well as imperceptible watermarking. Our contributions lead us to SilentCipher, a model enabling users to encode messages within audio signals sampled at 44.1kHz.

Mayank Kumar Singh, Naoya Takahashi, Weihsiang Liao, Yuki Mitsufuji• 2024

Related benchmarks

TaskDatasetResultRank
Audio WatermarkingGuitarSet
Survivability Detection Rate100
23
Audio WatermarkingLibriSpeech
Detection Accuracy99.19
23
Audio WatermarkingjaCappella
Survivability Rate46
23
Watermark RobustnessFreischuetz
Survivability97.26
16
Watermark RobustnessAIR
Survivability18.6
16
Audio WatermarkingAudio Robustness Benchmark averaged across 14 attacks
PESQ4.15
11
Audio Watermarking RobustnessLibriSpeech and Common Voice (test)
No Attack Robustness100
10
Audio WatermarkingClotho
Detectability Accuracy96.7
7
Audio WatermarkingPCD
Detection Accuracy96.7
7
Audio WatermarkingMAESTRO
Detectability Accuracy96.7
7
Showing 10 of 12 rows

Other info

Follow for update