Complementing Lexical Retrieval with Semantic Residual Embedding

About

This paper presents CLEAR, a retrieval model that seeks to complement classical lexical exact-match models such as BM25 with semantic matching signals from a neural embedding matching model. CLEAR explicitly trains the neural embedding to encode language structures and semantics that lexical retrieval fails to capture with a novel residual-based embedding learning method. Empirical evaluations demonstrate the advantages of CLEAR over state-of-the-art retrieval models, and that it can substantially improve the end-to-end accuracy and efficiency of reranking pipelines.

Luyu Gao, Zhuyun Dai, Tongfei Chen, Zhen Fan, Benjamin Van Durme, Jamie Callan• 2020

Related benchmarks

Task	Dataset	Result	Rank
Retrieval	MS MARCO (dev)	MRR@100.299		84
Retrieval	TREC DL 2019	NDCG@1066.4		83

Showing 2 of 2 rows

Other info

Follow for update

@wizwand_team Discord