Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CAR: Query-Guided Confidence-Aware Reranking for Retrieval-Augmented Generation

About

Retrieval-Augmented Generation (RAG) depends on document ranking to provide useful evidence for generation, but conventional reranking methods mainly optimize query-document relevance rather than generation usefulness. A relevant document may still introduce noise, while a lower-ranked document may better reduce the generator's uncertainty. We propose CAR (Confidence-Aware Reranking), a query-guided, training-free, and plug-and-play reranking framework that uses generator confidence change as a document usefulness signal. CAR estimates confidence through the semantic consistency of multiple sampled answers under query-only and query-document conditions. Documents that significantly increase confidence are promoted, those that decrease confidence are demoted, and uncertain cases preserve the baseline order, while a query-level gate avoids unnecessary intervention on already confident queries. Experiments on four BEIR datasets show that CAR consistently improves NDCG@5 across sparse and dense retrievers, LLM-based and supervised rerankers, and four LLM backbones. Notably, CAR improves the YesNo reranker by 25.4 percent on average under Contriever retrieval, and its ranking gains strongly correlate with downstream generation F1 improvements, achieving Spearman rho = 0.964.

Zhipeng Song, Yizhi Zhou, Xiangyu Kong, Jiulong Jiao, Xuezhou Ye, Chunqi Gao, Xueqing Shi, Yuhang Zhou, Heng Qi• 2026

Related benchmarks

TaskDatasetResultRank
Document RerankingNQ
NDCG@550.979
14
Document RerankingFEVER
NDCG@581.556
14
Document RerankingSCID
NDCG@513.919
14
Document RerankingCOVID
NDCG@571.038
14
Document RerankingAVG
NDCG@554.373
14
Document RerankingNQ (test)
NDCG@58.339
14
End-to-end generationNQ (test)
F1 Score15.818
14
Showing 7 of 7 rows

Other info

Follow for update