REARANK: Reasoning Re-ranking Agent via Reinforcement Learning

About

We present REARANK, a large language model (LLM)-based listwise reasoning reranking agent. REARANK explicitly reasons before reranking, significantly improving both performance and interpretability. Leveraging reinforcement learning and data augmentation, REARANK achieves substantial improvements over baseline models across popular information retrieval benchmarks, notably requiring only 179 annotated samples. Built on top of Qwen2.5-7B, our REARANK-7B demonstrates performance comparable to GPT-4 on both in-domain and out-of-domain benchmarks and even surpasses GPT-4 on reasoning-intensive BRIGHT benchmarks. These results underscore the effectiveness of our approach and highlight how reinforcement learning can enhance LLM reasoning capabilities in reranking.

Le Zhang, Bo Wang, Xipeng Qiu, Siva Reddy, Aishwarya Agrawal• 2025

Related benchmarks

Task	Dataset	Result
Question Answering	2Wiki	EM18.2	241
Multi-hop Question Answering	2Wiki	Exact Match16.6	215
Question Answering	HotpotQA	F143.2	132
Question Answering	MuSiQue	EM5.6	84
Multi-hop Question Answering	HotpotQA	F142	79
Passage Reranking	BRIGHT	NDCG@10 (Avg)31.75	54
Information Retrieval	BRIGHT 1.0 (test)	nDCG@10 (Avg)24.6	35
Long-context Memory Retrieval and Reasoning	PersonaMem 128K	F1 Score22.95	20
Long-context Memory Retrieval and Reasoning	WebDancer 128K	F1 Score37.23	20
Long-context Memory Retrieval and Reasoning	ZH4O 128K	F1 Score49.02	20

Showing 10 of 32 rows

Other info

Follow for update

@wizwand_team Discord