The Faiss library
About
Vector databases typically manage large collections of embedding vectors. Currently, AI applications are growing rapidly, and so is the number of embeddings that need to be stored and indexed. The Faiss library is dedicated to vector similarity search, a core functionality of vector databases. Faiss is a toolkit of indexing methods and related primitives used to search, cluster, compress and transform vectors. This paper describes the trade-off space of vector search and the design principles of Faiss in terms of structure, approach to optimization and interfacing. We benchmark key features of the library and discuss a few selected applications to highlight its broad applicability.
Matthijs Douze, Alexandr Guzhva, Chengqi Deng, Jeff Johnson, Gergely Szilvasy, Pierre-Emmanuel Mazar\'e, Maria Lomeli, Lucas Hosseini, Herv\'e J\'egou• 2024
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Insight Generation | Internal document collections non-scientific domain Legal Business Analysis 1.0 (test) | Insight Score4.6 | 20 | |
| Insight Generation | Internal Document Collections Climate Change Policy | Insight-level Score4.28 | 20 | |
| Insight Generation | Internal document collections non-scientific domain Revenue & Finance Reports | Insight-level Score4.4 | 20 | |
| Insight Generation | SCOpE-QA (Set-level) | Inference Optimization Score3.98 | 20 | |
| Insight-level Evaluation | SCOpE-QA Inference Optimization collection | Insight-level Score4.09 | 20 | |
| Insight-level Evaluation | SCOpE-QA LLM as Agents collection | Insight Score4.05 | 20 | |
| Insight-level Evaluation | SCOpE-QA Preference Optimization collection | Insight-level Score4.13 | 20 | |
| Insight-level Evaluation | SCOpE-QA Long-context RAG collection | Insight Score4.24 | 20 | |
| Insight-level Evaluation | SCOpE-QA Representation Learning collection | Insight Score4.12 | 20 | |
| Insight-level Evaluation | SCOpE-QA Hate Speech Detection | Insight-level Score4.29 | 20 |
Showing 10 of 49 rows