Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Optimizing Test-Time Query Representations for Dense Retrieval

About

Recent developments of dense retrieval rely on quality representations of queries and contexts from pre-trained query and context encoders. In this paper, we introduce TOUR (Test-Time Optimization of Query Representations), which further optimizes instance-level query representations guided by signals from test-time retrieval results. We leverage a cross-encoder re-ranker to provide fine-grained pseudo labels over retrieval results and iteratively optimize query representations with gradient descent. Our theoretical analysis reveals that TOUR can be viewed as a generalization of the classical Rocchio algorithm for pseudo relevance feedback, and we present two variants that leverage pseudo-labels as hard binary or soft continuous labels. We first apply TOUR on phrase retrieval with our proposed phrase re-ranker, and also evaluate its effectiveness on passage retrieval with an off-the-shelf re-ranker. TOUR greatly improves end-to-end open-domain question answering accuracy, as well as passage retrieval performance. TOUR also consistently improves direct re-ranking by up to 2.0% while running 1.3-2.4x faster with an efficient implementation.

Mujeen Sung, Jungsoo Park, Jaewoo Kang, Danqi Chen, Jinhyuk Lee• 2022

Related benchmarks

TaskDatasetResultRank
Open Question AnsweringNatural Questions (NQ) (test)
Exact Match (EM)48.6
134
Open-domain Question AnsweringTriviaQA (test)
Exact Match66.8
80
Passage retrievalTriviaQA (test)
Top-100 Acc86.1
67
Open-domain Question AnsweringWebQuestions (WebQ) (test)
Exact Match (EM)46.9
55
Open-domain Question AnsweringCuratedTREC (test)
Exact Match (EM)39.8
26
End-to-end Open-Domain Question AnsweringTREC (test)
Exact Match (EM)63.1
21
Passage retrievalNQ multi-dataset training (test)
Accuracy@2084.2
8
Passage retrievalEntityQuestions unseen query distribution (test)
Accuracy@200.662
8
Open-domain Question AnsweringEntityQuestions (ENTITYQ) (test)
EM28.3
7
Showing 9 of 9 rows

Other info

Code

Follow for update