Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DESSERT: An Efficient Algorithm for Vector Set Search with Vector Set Queries

About

We study the problem of $\textit{vector set search}$ with $\textit{vector set queries}$. This task is analogous to traditional near-neighbor search, with the exception that both the query and each element in the collection are $\textit{sets}$ of vectors. We identify this problem as a core subroutine for semantic search applications and find that existing solutions are unacceptably slow. Towards this end, we present a new approximate search algorithm, DESSERT (${\bf D}$ESSERT ${\bf E}$ffeciently ${\bf S}$earches ${\bf S}$ets of ${\bf E}$mbeddings via ${\bf R}$etrieval ${\bf T}$ables). DESSERT is a general tool with strong theoretical guarantees and excellent empirical performance. When we integrate DESSERT into ColBERT, a state-of-the-art semantic search model, we find a 2-5x speedup on the MS MARCO and LoTTE retrieval benchmarks with minimal loss in recall, underscoring the effectiveness and practical applicability of our proposal.

Joshua Engels, Benjamin Coleman, Vihan Lakshman, Anshumali Shrivastava• 2022

Related benchmarks

TaskDatasetResultRank
Information RetrievalArguAna
QPS576
9
Information RetrievalQuora
QPS284
9
Information RetrievalNQ
QPS38
8
Passage retrievalMS-MARCO (test)
Latency (ms)9.5
6
Multi-Vector RetrievalSCIDOCS
QPS285
5
Showing 5 of 5 rows

Other info

Code

Follow for update