Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Evaluating Dataset Watermarking for Fine-tuning Traceability of Customized Diffusion Models: A Comprehensive Benchmark and Removal Approach

About

Recent fine-tuning techniques for diffusion models enable them to reproduce specific image sets, such as particular faces or artistic styles, but also introduce copyright and security risks. Dataset watermarking has been proposed to ensure traceability by embedding imperceptible watermarks into training images, which remain detectable in outputs even after fine-tuning. However, current methods lack a unified evaluation framework. To address this, this paper establishes a general threat model and introduces a comprehensive evaluation framework encompassing Universality, Transmissibility, and Robustness. Experiments show that existing methods perform well in universality and transmissibility, and exhibit some robustness against common image processing operations, yet still fall short under real-world threat scenarios. To reveal these vulnerabilities, the paper further proposes a practical watermark removal method that fully eliminates dataset watermarks without affecting fine-tuning, highlighting a key challenge for future research.

Xincheng Wang, Hanchi Sun, Wenjun Sun, Kejun Xue, Wangqiu Zhou, Jianbo Zhang, Wei Sun, Dandan Zhu, Xiongkuo Min, Jun Jia, Zhijun Fang• 2025

Related benchmarks

TaskDatasetResultRank
Watermark RemovalCelebA-HQ LoRA, w/o te
CLIP-T Score0.2614
24
Text-to-Image GenerationPokémon
CLIP Score0.2931
21
Text-to-Image GenerationWikiArt
CLIP-T0.2581
7
Fine-tuning GenerationCelebA-HQ
CLIP-T Score0.2389
4
Fine-tuning GenerationPokemon
CLIP-T0.2775
4
Text-to-Image GenerationCelebA-HQ
CLIP-T Score0.2589
3
Showing 6 of 6 rows

Other info

Follow for update