Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

About

Large vision-language models (LVLMs) have demonstrated remarkable image understanding and dialogue capabilities, allowing them to handle a variety of visual question answering tasks. However, their widespread availability raises concerns about unauthorized usage and copyright infringement, where users or individuals can develop their own LVLMs by fine-tuning published models. In this paper, we propose a novel method called Parameter Learning Attack (PLA) for tracking the copyright of LVLMs without modifying the original model. Specifically, we construct adversarial images through targeted attacks against the original model, enabling it to generate specific outputs. To ensure these attacks remain effective on potential fine-tuned models to trigger copyright tracking, we allow the original model to learn the trigger images by updating parameters in the opposite direction during the adversarial attack process. Notably, the proposed method can be applied after the release of the original model, thus not affecting the model's performance and behavior. To simulate real-world applications, we fine-tune the original model using various strategies across diverse datasets, creating a range of models for copyright verification. Extensive experiments demonstrate that our method can more effectively identify the original copyright of fine-tuned models compared to baseline methods. Therefore, this work provides a powerful tool for tracking copyrights and detecting unlicensed usage of LVLMs.

Yubo Wang, Jianting Tang, Chaohu Liu, Linli Xu• 2025

Related benchmarks

TaskDatasetResultRank
Copyright trackingV7W
ASR51
13
Copyright trackingTextVQA
ASR45
13
Copyright trackingST-VQA
ASR53
13
Copyright trackingPaintingF
ASR38
8
Copyright trackingMathV
ASR45
8
Copyright trackingV7W subsets of 28k
ASR48
8
Copyright trackingST-VQA full (train)
ASR68
8
Copyright trackingTextVQA (train)
ASR33
8
Copyright trackingMathV360k subsets of 50k
ASR0.6
8
Copyright trackingPaintingForm (subsets of 20k)
ASR76
8
Showing 10 of 17 rows

Other info

Follow for update