Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

UnMarker: A Universal Attack on Defensive Image Watermarking

About

Reports regarding the misuse of Generative AI (GenAI) to create deepfakes are frequent. Defensive watermarking enables GenAI providers to hide fingerprints in their images and use them later for deepfake detection. Yet, its potential has not been fully explored. We present UnMarker -- the first practical universal attack on defensive watermarking. Unlike existing attacks, UnMarker requires no detector feedback, no unrealistic knowledge of the watermarking scheme or similar models, and no advanced denoising pipelines that may not be available. Instead, being the product of an in-depth analysis of the watermarking paradigm revealing that robust schemes must construct their watermarks in the spectral amplitudes, UnMarker employs two novel adversarial optimizations to disrupt the spectra of watermarked images, erasing the watermarks. Evaluations against SOTA schemes prove UnMarker's effectiveness. It not only defeats traditional schemes while retaining superior quality compared to existing attacks but also breaks semantic watermarks that alter an image's structure, reducing the best detection rate to $43\%$ and rendering them useless. To our knowledge, UnMarker is the first practical attack on semantic watermarks, which have been deemed the future of defensive watermarking. Our findings show that defensive watermarking is not a viable defense against deepfakes, and we urge the community to explore alternatives.

Andre Kassis, Urs Hengartner• 2024

Related benchmarks

TaskDatasetResultRank
Watermark VerificationDiffusionDB (test)
TPR@1%FPR3.1
15
Quality PreservationDiffusionDB (test)
FID50.69
13
Quality PreservationMS-COCO (test)
FID49.85
13
Quality PreservationSD-Prompts (test)
FID55.48
13
Watermark Removal AttackSS in-processing watermarking scheme
Bit Accuracy56.38
13
Watermark RemovalDiffusionDB-2M
LPIPS0.614
9
Watermark VerificationMS-COCO 2017 (test)
TPR @ 1% FPR0.032
9
Watermark Removal AttackSS Watermarking Scheme
PSNR21.86
9
Watermark Removal AttackHiDDeN Watermarking Scheme
PSNR23.54
9
Watermark Removal AttackYu Watermarking Scheme
PSNR17.77
9
Showing 10 of 17 rows

Other info

Follow for update