Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WMVLM: Evaluating Diffusion Model Image Watermarking via Vision-Language Models

About

Digital watermarking is essential for securing generated images from diffusion models. Accurate watermark evaluation is critical for algorithm development, yet existing methods have significant limitations: they lack a unified framework for both residual and semantic watermarks, provide results without interpretability, neglect comprehensive security considerations, and often use inappropriate metrics for semantic watermarks. To address these gaps, we propose WMVLM, the first unified and interpretable evaluation framework for diffusion model image watermarking via vision-language models (VLMs). We redefine quality and security metrics for each watermark type: residual watermarks are evaluated by artifact strength and erasure resistance, while semantic watermarks are assessed through latent distribution shifts. Moreover, we introduce a three-stage training strategy to progressively enable the model to achieve classification, scoring, and interpretable text generation. Experiments show WMVLM outperforms state-of-the-art VLMs with strong generalization across datasets, diffusion models, and watermarking methods.

Zijin Yang, Yu Sun, Kejiang Chen, Jiawei Zhao, Jun Jiang, Weiming Zhang, Nenghai Yu• 2026

Related benchmarks

TaskDatasetResultRank
Image Watermark EvaluationSD DwtDct v1.4
PLCC0.951
12
Image Watermark EvaluationSD RivaGAN v1.4
PLCC0.81
12
Image Watermark EvaluationSD HiDDeN v1.4
PLCC0.656
12
Image Watermark EvaluationSD RW v1.4
PLCC0.849
12
Image Watermark EvaluationSD VINE v1.4
PLCC0.978
12
Image Watermark EvaluationSD SS v1.4
PLCC0.368
12
Image Watermark EvaluationSD RingID v1.4
Quality Accuracy79.2
12
Image Watermark EvaluationSD Tree-Ring v1.4
Quality Accuracy89
12
Image Watermarking EvaluationGaussMarker
Quality Accuracy95.6
12
Image Watermarking EvaluationT2SMark
Quality Accuracy93
12
Showing 10 of 15 rows

Other info

Follow for update