Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Accelerating Large-Scale Inference with Anisotropic Vector Quantization

About

Quantization based techniques are the current state-of-the-art for scaling maximum inner product search to massive databases. Traditional approaches to quantization aim to minimize the reconstruction error of the database points. Based on the observation that for a given query, the database points that have the largest inner products are more relevant, we develop a family of anisotropic quantization loss functions. Under natural statistical assumptions, we show that quantization with these loss functions leads to a new variant of vector quantization that more greatly penalizes the parallel component of a datapoint's residual relative to its orthogonal component. The proposed approach achieves state-of-the-art results on the public benchmarks available at \url{ann-benchmarks.com}.

Ruiqi Guo, Philip Sun, Erik Lindgren, Quan Geng, David Simcha, Felix Chern, Sanjiv Kumar• 2019

Related benchmarks

TaskDatasetResultRank
Approximate Nearest Neighbor SearchANNS Empirical Evaluation Suite
QPS vs Recall Tier-1
9
kNNGlove 100
Throughput (QPS)3.39e+5
7
Approximate Nearest Neighbor SearchFashion MNIST
Throughput (QPS)2.08e+6
4
Approximate Nearest Neighbor SearchDEEP 10M
Throughput (QPS)1.47e+5
4
Approximate Nearest Neighbor SearchGIST
Throughput (QPS)3.80e+4
4
Showing 5 of 5 rows

Other info

Follow for update