Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Beyond Yes and No: Improving Zero-Shot LLM Rankers via Scoring Fine-Grained Relevance Labels

About

Zero-shot text rankers powered by recent LLMs achieve remarkable ranking performance by simply prompting. Existing prompts for pointwise LLM rankers mostly ask the model to choose from binary relevance labels like "Yes" and "No". However, the lack of intermediate relevance label options may cause the LLM to provide noisy or biased answers for documents that are partially relevant to the query. We propose to incorporate fine-grained relevance labels into the prompt for LLM rankers, enabling them to better differentiate among documents with different levels of relevance to the query and thus derive a more accurate ranking. We study two variants of the prompt template, coupled with different numbers of relevance levels. Our experiments on 8 BEIR data sets show that adding fine-grained relevance labels significantly improves the performance of LLM rankers.

Honglei Zhuang, Zhen Qin, Kai Hui, Junru Wu, Le Yan, Xuanhui Wang, Michael Bendersky• 2023

Related benchmarks

TaskDatasetResultRank
RankingBEIR selected subset v1.0.0 (test)
TREC-COVID79.02
38
Passage RankingNQ
MRR39.49
29
Passage RankingWebQuestions (WQ)
R@1062.05
28
Passage retrievalNatural Questions (NQ)
Top-10 Accuracy63.07
28
Passage RankingTREC DL 2019
R@1093.33
28
Passage RankingTREC DL 2020
R@1093.18
28
Pointwise RankingTREC DL 2020 (test)
nDCG@100.6059
19
RankingNTCIR
nDCG73.6
8
CalibrationNTCIR
CB-ECE1.677
8
RankingTREC
nDCG0.79
8
Showing 10 of 11 rows

Other info

Follow for update