Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models

About

In order to thrive in hostile and ever-changing natural environments, mammalian brains evolved to store large amounts of knowledge about the world and continually integrate new information while avoiding catastrophic forgetting. Despite the impressive accomplishments, large language models (LLMs), even with retrieval-augmented generation (RAG), still struggle to efficiently and effectively integrate a large amount of new experiences after pre-training. In this work, we introduce HippoRAG, a novel retrieval framework inspired by the hippocampal indexing theory of human long-term memory to enable deeper and more efficient knowledge integration over new experiences. HippoRAG synergistically orchestrates LLMs, knowledge graphs, and the Personalized PageRank algorithm to mimic the different roles of neocortex and hippocampus in human memory. We compare HippoRAG with existing RAG methods on multi-hop question answering and show that our method outperforms the state-of-the-art methods remarkably, by up to 20%. Single-step retrieval with HippoRAG achieves comparable or better performance than iterative retrieval like IRCoT while being 10-30 times cheaper and 6-13 times faster, and integrating HippoRAG into IRCoT brings further substantial gains. Finally, we show that our method can tackle new types of scenarios that are out of reach of existing methods. Code and data are available at https://github.com/OSU-NLP-Group/HippoRAG.

Bernal Jim\'enez Guti\'errez, Yiheng Shu, Yu Gu, Michihiro Yasunaga, Yu Su• 2024

Related benchmarks

TaskDatasetResultRank
Multi-hop Question Answering2WikiMultihopQA
EM59.1
559
Multi-hop Question AnsweringHotpotQA (test)
F164.7
311
Multi-hop Question AnsweringHotpotQA
F1 Score70.5
294
Question Answering2Wiki
EM59.4
241
Multi-hop Question Answering2WikiMultiHopQA (test)
EM57.8
226
Multi-hop Question Answering2Wiki
Exact Match59.4
215
Multi-hop Question AnsweringMuSiQue
EM11.7
209
Long-context Question AnsweringLocomo
F1 (Multi Hop)33.59
171
Multi-hop Question Answering2WikiMQA
F1 Score62.38
161
Multi-hop QAHotpotQA
Exact Match52.6
143
Showing 10 of 233 rows
...

Other info

Follow for update