Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

About

Large vision-language models (LVLMs) have demonstrated remarkable image understanding and dialogue capabilities, allowing them to handle a variety of visual question answering tasks. However, their widespread availability raises concerns about unauthorized usage and copyright infringement, where users or individuals can develop their own LVLMs by fine-tuning published models. In this paper, we propose a novel method called Parameter Learning Attack (PLA) for tracking the copyright of LVLMs without modifying the original model. Specifically, we construct adversarial images through targeted attacks against the original model, enabling it to generate specific outputs. To ensure these attacks remain effective on potential fine-tuned models to trigger copyright tracking, we allow the original model to learn the trigger images by updating parameters in the opposite direction during the adversarial attack process. Notably, the proposed method can be applied after the release of the original model, thus not affecting the model's performance and behavior. To simulate real-world applications, we fine-tune the original model using various strategies across diverse datasets, creating a range of models for copyright verification. Extensive experiments demonstrate that our method can more effectively identify the original copyright of fine-tuned models compared to baseline methods. Therefore, this work provides a powerful tool for tracking copyrights and detecting unlicensed usage of LVLMs.

Yubo Wang, Jianting Tang, Chaohu Liu, Linli Xu• 2025

Related benchmarks

TaskDatasetResultRank
Fingerprint MatchingQuantization Fingerprint Queries
Performance (4-bit)76
16
Fingerprint MatchingV7W
FMR0.2
16
Fingerprint MatchingPaintingForm
FMR13
16
Fingerprint MatchingMathV
FMR14
16
Fingerprint MatchingTextVQA
FMR21
16
Copyright trackingV7W
ASR51
13
Copyright trackingTextVQA
ASR45
13
Copyright trackingST-VQA
ASR53
13
Copyright trackingPaintingF
ASR38
8
Copyright trackingMathV
ASR45
8
Showing 10 of 26 rows

Other info

Follow for update