Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Less Is More: Elevating RAG via Performance-Driven Context Compression

About

Retrieval-Augmented Generation (RAG) has emerged as a promising paradigm for improving the timeliness of knowledge updates and the factual accuracy of large language models. However, incorporating a large volume of retrieved documents significantly increases input length, leading to prohibitive computational costs. Existing compression approaches often compromise task performance, primarily due to their reliance on predefined heuristics. These heuristics fail to ensure that the compressed context is conducive to the generation tasks. To address these limitations, we propose CORE-RAG, a novel framework for context compression in RAG systems. CORE eliminates reliance on proxy heuristics through a performance-driven learning framework, which directy utilizes task performance as a feedback signal to iteratively refine the compressor policy. Prior to this optimization process, we incorporate a knowledge distillation phase to initialize the compressor with a robust policy. Extensive experiments demonstrate the superiority of our approach. At a high compression ratio of 3%, CORE not only avoids performance degradation but also improves the average Exact Match (EM) score by 3.3 points compared to using full documents. Our code is available at https://github.com/ziqiangcui/CORE-RAG-ICML26.

Ziqiang Cui, Yunpeng Weng, Xing Tang, Peiyang Liu, Shiwei Li, Bowei He, Jiamin Chen, Yansen Zhang, Xiuqiang He, Chen Ma• 2025

Related benchmarks

TaskDatasetResultRank
Question Answering2WikiMultiHopQA (test)
F141.62
113
Open-domain Question AnsweringTriviaQA
EM67.36
88
Open-domain Question AnsweringNQ--
74
Open-domain Question AnsweringHotpotQA
Accuracy36.93
73
Open-domain Question AnsweringNQ (test)
EM40.33
39
Question Answering2WikiMultihopQA LongBench
F1 Score59.73
32
Open-domain Question AnsweringHotpotQA (test)
Accuracy (Exact Match)34.12
17
Open-domain Question Answering2WikiMultihopQA
EM40.71
16
Question AnsweringNQ
Exact Match (EM)41.88
15
Question AnsweringLongBench HotpotQA
F1 Score63.58
12
Showing 10 of 14 rows

Other info

Follow for update