Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ALDEN: Boosting Private Data Extraction from Retrieval-Augmented Generation Systems via Active Learning and Distribution Estimation

About

Retrieval-Augmented Generation (RAG) is widely used to augment large language models with external knowledge retrieval to improve reliability and generalization. However, recent studies have shown that RAG systems remain vulnerable to data extraction attacks, where adversaries can extract private data by embedding malicious commands into user queries. Despite their feasibility, existing attacks typically suffer from low data extraction rates and limited practical effectiveness. Here, we propose ALDEN, a novel attack that effectively and efficiently extracts private data from RAGs. First, we employ active learning to diversify malicious queries and improve data extraction rates. Second, we observe that the data distribution of the underlying knowledge base provides valuable guidance for query generation and introduce a decay-based dynamic algorithm to estimate the corresponding topic distribution. By combining them together, we demonstrate that ALDEN substantially outperforms state-of-the-art methods through comprehensive evaluations.

Xingyu Lyu, Jianfeng He, Ning Wang, Yidan Hu, Tao Li, Danjue Chen, Shixiong Li, Yimin Chen• 2026

Related benchmarks

TaskDatasetResultRank
Targeted AttackHealthcareMagic-101
LC1.28e+3
18
Targeted AttackENRON EMAIL
LC1.49e+3
18
Targeted AttackSynthetic Finance
LC1.08e+3
18
Untargeted data extraction attackHealthcareMagic-101
LC1.71e+3
18
Showing 4 of 4 rows

Other info

Follow for update