Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models

About

In order to thrive in hostile and ever-changing natural environments, mammalian brains evolved to store large amounts of knowledge about the world and continually integrate new information while avoiding catastrophic forgetting. Despite the impressive accomplishments, large language models (LLMs), even with retrieval-augmented generation (RAG), still struggle to efficiently and effectively integrate a large amount of new experiences after pre-training. In this work, we introduce HippoRAG, a novel retrieval framework inspired by the hippocampal indexing theory of human long-term memory to enable deeper and more efficient knowledge integration over new experiences. HippoRAG synergistically orchestrates LLMs, knowledge graphs, and the Personalized PageRank algorithm to mimic the different roles of neocortex and hippocampus in human memory. We compare HippoRAG with existing RAG methods on multi-hop question answering and show that our method outperforms the state-of-the-art methods remarkably, by up to 20%. Single-step retrieval with HippoRAG achieves comparable or better performance than iterative retrieval like IRCoT while being 10-30 times cheaper and 6-13 times faster, and integrating HippoRAG into IRCoT brings further substantial gains. Finally, we show that our method can tackle new types of scenarios that are out of reach of existing methods. Code and data are available at https://github.com/OSU-NLP-Group/HippoRAG.

Bernal Jim\'enez Guti\'errez, Yiheng Shu, Yu Gu, Michihiro Yasunaga, Yu Su• 2024

Related benchmarks

TaskDatasetResultRank
Multi-hop Question Answering2WikiMultihopQA
EM30
278
Multi-hop Question AnsweringHotpotQA
F1 Score63.91
221
Multi-hop Question AnsweringHotpotQA (test)
F164.7
198
Multi-hop Question Answering2WikiMQA
F1 Score62.38
154
Multi-hop Question Answering2WikiMultiHopQA (test)
EM57.8
143
Question AnsweringHotpotQA
F171.1
114
Multi-hop Question AnsweringMuSiQue (test)
F135.9
111
Question AnsweringMuSiQue
EM24
84
Question AnsweringPopQA
EM42.5
80
Multi-hop Question AnsweringHotpotQA
F155.87
79
Showing 10 of 98 rows
...

Other info

Follow for update