Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GeAR: Generation Augmented Retrieval

About

Document retrieval techniques are essential for developing large-scale information systems. The common approach involves using a bi-encoder to compute the semantic similarity between a query and documents. However, the scalar similarity often fail to reflect enough information, hindering the interpretation of retrieval results. In addition, this process primarily focuses on global semantics, overlooking the finer-grained semantic relationships between the query and the document's content. In this paper, we introduce a novel method, $\textbf{Ge}$neration $\textbf{A}$ugmented $\textbf{R}$etrieval ($\textbf{GeAR}$), which not only improves the global document-query similarity through contrastive learning, but also integrates well-designed fusion and decoding modules. This enables GeAR to generate relevant context within the documents based on a given query, facilitating learning to retrieve local fine-grained information. Furthermore, when used as a retriever, GeAR does not incur any additional computational cost over bi-encoders. GeAR exhibits competitive retrieval performance across diverse scenarios and tasks. Moreover, qualitative analysis and the results generated by GeAR provide novel insights into the interpretation of retrieval results. The code, data, and models will be released at \href{https://github.com/microsoft/LMOps}{https://github.com/microsoft/LMOps}.

Haoyu Liu, Shaohan Huang, Jianfeng Liu, Yuefeng Zhan, Hao Sun, Weiwei Deng, Feng Sun, Furu Wei, Qi Zhang• 2025

Related benchmarks

TaskDatasetResultRank
Question AnsweringSQuAD
F164.5
127
Question AnsweringNQ
EM65.7
57
Question AnsweringTriviaQA
F159.1
46
Local Information RetrievalSQuAD
Recall@181.4
10
Local Information RetrievalNQ
Recall@176.1
10
Local Information RetrievalTriviaQA
Recall@10.51
10
Local Information RetrievalPAQ
Recall@10.884
10
Local Information RetrievalRIR
Recall@393.3
10
Global document retrievalSQuAD
Recall@588.9
9
Global document retrievalPAQ
Recall@595.5
9
Showing 10 of 15 rows

Other info

Follow for update