Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

One Pic is All it Takes: Poisoning Visual Document Retrieval Augmented Generation with a Single Image

About

Retrieval-augmented generation (RAG) is instrumental for inhibiting hallucinations in large language models (LLMs) through the use of a factual knowledge base (KB). Although PDF documents are prominent sources of knowledge, text-based RAG pipelines are ineffective at capturing their rich multi-modal information. In contrast, visual document RAG (VD-RAG) uses screenshots of document pages as the KB, which has been shown to achieve state-of-the-art results. However, by introducing the image modality, VD-RAG introduces new attack vectors for adversaries to disrupt the system by injecting malicious documents into the KB. In this paper, we demonstrate the vulnerability of VD-RAG to poisoning attacks targeting both retrieval and generation. We define two attack objectives and demonstrate that both can be realized by injecting only a single adversarial image into the KB. Firstly, we introduce a targeted attack against one or a group of queries with the goal of spreading targeted disinformation. Secondly, we present a universal attack that, for any potential user query, influences the response to cause a denial-of-service in the VD-RAG system. We investigate the two attack objectives under both white-box and black-box assumptions, employing a multi-objective gradient-based optimization approach as well as prompting state-of-the-art generative models. Using two visual document datasets, a diverse set of state-of-the-art retrievers (embedding models) and generators (vision language models), we show VD-RAG is vulnerable to poisoning attacks in both the targeted and universal settings, yet demonstrating robustness to black-box attacks in the universal setting.

Ezzeldin Shereen, Dan Ristea, Shae McFadden, Burak Hasircioglu, Vasilios Mavroudis, Chris Hicks• 2025

Related benchmarks

TaskDatasetResultRank
Document RetrievalViDoRe AI v1 (test)
Delta Recall@10.19
23
GenerationViDoRe AI v1 (test)
ASR-G Similarity (mean) @-11
23
Targeted AttackViDoRe AI V1
ASR-R@1 (mean)100
18
GenerationViDoRe AI (Targeted Setting III) V1
ASR-G Sim^-1 (Mean)93
10
RetrievalViDoRe AI (Targeted Setting III) V1
ASR-R@1 (Mean)100
10
Targeted Poisoning AttackViDoRe AI (Targeted Setting II: 5 queries, 1 answer) V1
ASR @1 (mean)88
10
Document RetrievalViDoRe ESG V2
Recall@148
9
Targeted Poisoning AttackViDoRe ESG v2 (test)
ASR @1100
9
Text GenerationViDoRe ESG V2
ASR-G Sim (mean) @-1100
9
Showing 9 of 9 rows

Other info

Follow for update