WMVLM: Evaluating Diffusion Model Image Watermarking via Vision-Language Models

About

Digital watermarking is essential for securing generated images from diffusion models. Accurate watermark evaluation is critical for algorithm development, yet existing methods have significant limitations: they lack a unified framework for both residual and semantic watermarks, provide results without interpretability, neglect comprehensive security considerations, and often use inappropriate metrics for semantic watermarks. To address these gaps, we propose WMVLM, the first unified and interpretable evaluation framework for diffusion model image watermarking via vision-language models (VLMs). We redefine quality and security metrics for each watermark type: residual watermarks are evaluated by artifact strength and erasure resistance, while semantic watermarks are assessed through latent distribution shifts. Moreover, we introduce a three-stage training strategy to progressively enable the model to achieve classification, scoring, and interpretable text generation. Experiments show WMVLM outperforms state-of-the-art VLMs with strong generalization across datasets, diffusion models, and watermarking methods.

Zijin Yang, Yu Sun, Kejiang Chen, Jiawei Zhao, Jun Jiang, Weiming Zhang, Nenghai Yu• 2026

Related benchmarks

Task	Dataset	Result
Image Watermark Evaluation	SD DwtDct v1.4	PLCC0.951	12
Image Watermark Evaluation	SD RivaGAN v1.4	PLCC0.81	12
Image Watermark Evaluation	SD HiDDeN v1.4	PLCC0.656	12
Image Watermark Evaluation	SD RW v1.4	PLCC0.849	12
Image Watermark Evaluation	SD VINE v1.4	PLCC0.978	12
Image Watermark Evaluation	SD SS v1.4	PLCC0.368	12
Image Watermark Evaluation	SD RingID v1.4	Quality Accuracy79.2	12
Image Watermark Evaluation	SD Tree-Ring v1.4	Quality Accuracy89	12
Image Watermarking Evaluation	GaussMarker	Quality Accuracy95.6	12
Image Watermarking Evaluation	T2SMark	Quality Accuracy93	12

Showing 10 of 15 rows

Other info

Follow for update

@wizwand_team Discord