GroupRank: A Groupwise Paradigm for Effective and Efficient Passage Reranking with LLMs
About
Large Language Models (LLMs) have emerged as powerful tools for passage reranking in information retrieval, leveraging their superior reasoning capabilities to address the limitations of conventional models on complex queries. However, current LLM-based reranking paradigms are fundamentally constrained by an efficiency-accuracy trade-off: (1) pointwise methods are efficient but ignore inter-document comparison, yielding suboptimal accuracy; (2) listwise methods capture global context but suffer from context-window constraints and prohibitive inference latency. To address these issues, we propose GroupRank, a novel paradigm that balances flexibility and context awareness. To unlock the full potential of groupwise reranking, we propose an answer-free data synthesis pipeline that fuses local pointwise signals with global listwise rankings. These samples facilitate supervised fine-tuning and reinforcement learning, with the latter guided by a specialized group-ranking reward comprising ranking-utility and group-alignment. These complementary components synergistically optimize document ordering and score calibration to reflect intrinsic query-document relevance. Experimental results show GroupRank achieves a state-of-the-art 65.2 NDCG@10 on BRIGHT and surpasses baselines by 2.1 points on R2MED, while delivering a 6.4$\times$ inference speedup.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Retrieval | HotpotQA | R@590.6 | 36 | |
| Document Reranking | BEIR | Average NDCG@1055.1 | 12 | |
| Passage Reranking | BRIGHT | NDCG@10 (Avg)38 | 12 | |
| Reranking | R2MED (test) | Average Score52.3 | 12 | |
| Retrieval | MuSiQue | Recall@565.08 | 10 | |
| Retrieval | DetectiveQA | Recall@329.34 | 8 | |
| Retrieval | NarrativeQA | Recall@323.98 | 8 | |
| Retrieval | Overall (Musique, HotpotQA, NarrativeQA, DetectiveQA) | Avg Recall@347.82 | 8 | |
| Retrieval and Reranking | LoCoMo (test) | Recall@377.99 | 5 |