Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Rank-R1: Enhancing Reasoning in LLM-based Document Rerankers via Reinforcement Learning

About

In this paper, we introduce Rank-R1, a novel LLM-based reranker that performs reasoning over both the user query and candidate documents before performing the ranking task. Existing document reranking methods based on large language models (LLMs) typically rely on prompting or fine-tuning LLMs to order or label candidate documents according to their relevance to a query. For Rank-R1, we use a reinforcement learning algorithm along with only a small set of relevance labels (without any reasoning supervision) to enhance the reasoning ability of LLM-based rerankers. Our hypothesis is that adding reasoning capabilities to the rerankers can improve their relevance assessement and ranking capabilities. Our experiments on the TREC DL and BRIGHT datasets show that Rank-R1 is highly effective, especially for complex queries. In particular, we find that Rank-R1 achieves effectiveness on in-domain datasets at par with that of supervised fine-tuning methods, but utilizing only 18\% of the training data used by the fine-tuning methods. We also find that the model largely outperforms zero-shot and supervised fine-tuning when applied to out-of-domain datasets featuring complex queries, especially when a 14B-size model is used. Finally, we qualitatively observe that Rank-R1's reasoning process improves the explainability of the ranking results, opening new opportunities for search engine results presentation and fruition.

Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon• 2025

Related benchmarks

TaskDatasetResultRank
Question AnsweringHotpotQA
F142.5
114
Document RankingTREC DL Track 2019 (test)
nDCG@1072.7
96
Question AnsweringMuSiQue
EM5.2
84
Multi-hop Question AnsweringHotpotQA
F137.1
79
Question Answering2Wiki
F130.3
75
RerankingTREC 2020 (test)
NDCG@1069.1
55
Multi-hop Question Answering2Wiki
F1 Score28
41
Information RetrievalBRIGHT 1.0 (test)
nDCG@10 (Avg)24.2
18
Passage RerankingBEIR (test)
Covid84.63
11
Rich-media Re-rankingRich-Media Search Broad-needs Query 1.0 (test)
N@483.9
10
Showing 10 of 13 rows

Other info

Follow for update