Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FAVOR: Efficient Filter-Agnostic Vector ANNS Based on Selectivity-Aware Exclusion Distances

About

Modern retrieval systems increasingly require integrating approximate nearest neighbor search (ANNS) with complex attribute filtering to handle hybrid queries in applications such as recommendation systems and retrieval-augmented generation (RAG). While HNSW-based inline-filtering methods show promise, existing approaches struggle to deliver high throughput under low-selectivity scenarios while balancing search efficiency, filtering generality, and index connectivity. To address these challenges, we propose FAVOR, an efficient filter-agnostic vector ANNS that supports arbitrary filtering conditions while maintaining stable performance across varying selectivity levels. FAVOR introduces three novel features: (1) an integrated architecture that unifies selectivity estimation and filtered ANNS execution, providing a cohesive solution for hybrid vector-attribute queries; (2) a HNSW-based inline-filtering algorithm that introduces an exclusion distance mechanism to dynamically reshape the vector distance distribution, pushing non-target vectors away from the query while promoting valid candidates toward the query, thus improving search efficiency without compromising generality or graph connectivity; and (3) a selectivity-driven search selector that estimates query selectivity and dynamically routes queries between a pre-filtering brute-force algorithm for low-selectivity cases and an optimized HNSW-based search algorithm for other scenarios, ensuring consistent performance. Extensive experiments on real-world datasets demonstrate that FAVOR achieves a 1.3-5$\times$ higher QPS at $Recall@10 = 95\%$ compared to state-of-the-art methods for arbitrary filtering conditions, while maintaining competitive performance even against tailored solutions in some filtering conditions.

Junjie Song, Yu Liu, Guoyu Hu, Zhongle Xie, Ming Yang, Beng Chin Ooi, Ke Zhou• 2026

Related benchmarks

TaskDatasetResultRank
Approximate Nearest Neighbor SearchSIFT1M--
24
Index ConstructionSIFT-1M
Construction Time (s)11.2
8
Approximate Nearest Neighbor SearchDEEP1M
Storage Overhead (GB)0.63
6
Approximate Nearest Neighbor SearchPaper
Storage overhead (GB)2.04
6
Index ConstructionGIST1M
Construction Time (s)64.7
6
Index ConstructionDEEP1M
Construction Time (s)9.7
6
Index ConstructionWords
Construction Time (s)1.3
6
Index ConstructionLAION25M
Construction Time (s)1.15e+3
6
Approximate Nearest Neighbor SearchGIST1M
Storage Overhead (GB)4.08
6
Approximate Nearest Neighbor SearchMsong
Storage Overhead (GB)2.05
6
Showing 10 of 14 rows

Other info

Follow for update