Rank-R1: Enhancing Reasoning in LLM-based Document Rerankers via Reinforcement Learning

About

In this paper, we introduce Rank-R1, a novel LLM-based reranker that performs reasoning over both the user query and candidate documents before performing the ranking task. Existing document reranking methods based on large language models (LLMs) typically rely on prompting or fine-tuning LLMs to order or label candidate documents according to their relevance to a query. For Rank-R1, we use a reinforcement learning algorithm along with only a small set of relevance labels (without any reasoning supervision) to enhance the reasoning ability of LLM-based rerankers. Our hypothesis is that adding reasoning capabilities to the rerankers can improve their relevance assessement and ranking capabilities. Our experiments on the TREC DL and BRIGHT datasets show that Rank-R1 is highly effective, especially for complex queries. In particular, we find that Rank-R1 achieves effectiveness on in-domain datasets at par with that of supervised fine-tuning methods, but utilizing only 18\% of the training data used by the fine-tuning methods. We also find that the model largely outperforms zero-shot and supervised fine-tuning when applied to out-of-domain datasets featuring complex queries, especially when a 14B-size model is used. Finally, we qualitatively observe that Rank-R1's reasoning process improves the explainability of the ranking results, opening new opportunities for search engine results presentation and fruition.

Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon• 2025

Related benchmarks

Task	Dataset	Result
Question Answering	2Wiki	EM18	260
Multi-hop Question Answering	2Wiki	Exact Match16.4	215
Information Retrieval	BEIR	SciFact0.722	174
Document Ranking	TREC DL Track 2019 (test)	nDCG@1072.7	133
Question Answering	HotpotQA	F142.5	132
Information Retrieval	BRIGHT	Mean nDCG@1020.5	94
Question Answering	MuSiQue	EM5.2	84
Multi-hop Question Answering	HotpotQA	F137.1	79
Reranking	TREC 2020 (test)	NDCG@1069.1	55
Passage Reranking	BRIGHT	NDCG@10 (Avg)30.8	54

Showing 10 of 26 rows

Other info

Follow for update

@wizwand_team Discord