Less Is More: Elevating RAG via Performance-Driven Context Compression

About

Retrieval-Augmented Generation (RAG) has emerged as a promising paradigm for improving the timeliness of knowledge updates and the factual accuracy of large language models. However, incorporating a large volume of retrieved documents significantly increases input length, leading to prohibitive computational costs. Existing compression approaches often compromise task performance, primarily due to their reliance on predefined heuristics. These heuristics fail to ensure that the compressed context is conducive to the generation tasks. To address these limitations, we propose CORE-RAG, a novel framework for context compression in RAG systems. CORE eliminates reliance on proxy heuristics through a performance-driven learning framework, which directy utilizes task performance as a feedback signal to iteratively refine the compressor policy. Prior to this optimization process, we incorporate a knowledge distillation phase to initialize the compressor with a robust policy. Extensive experiments demonstrate the superiority of our approach. At a high compression ratio of 3%, CORE not only avoids performance degradation but also improves the average Exact Match (EM) score by 3.3 points compared to using full documents. Our code is available at https://github.com/ziqiangcui/CORE-RAG-ICML26.

Ziqiang Cui, Yunpeng Weng, Xing Tang, Peiyang Liu, Shiwei Li, Bowei He, Jiamin Chen, Yansen Zhang, Xiuqiang He, Chen Ma• 2025

Related benchmarks

Task	Dataset	Result
Question Answering	2WikiMultiHopQA (test)	F141.62	113
Open-domain Question Answering	HotpotQA	Accuracy36.93	103
Open-domain Question Answering	TriviaQA	EM67.36	88
Open-domain Question Answering	NQ	--	74
Open-domain Question Answering	NQ (test)	EM40.33	39
Question Answering	2WikiMultihopQA LongBench	F1 Score59.73	32
Open-domain Question Answering	HotpotQA (test)	Accuracy (Exact Match)34.12	27
Open-domain Question Answering	2WikiMultihopQA	EM40.71	16
Question Answering	NQ	Exact Match (EM)41.88	15
Question Answering	LongBench HotpotQA	F1 Score63.58	12

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord