Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models

About

In order to thrive in hostile and ever-changing natural environments, mammalian brains evolved to store large amounts of knowledge about the world and continually integrate new information while avoiding catastrophic forgetting. Despite the impressive accomplishments, large language models (LLMs), even with retrieval-augmented generation (RAG), still struggle to efficiently and effectively integrate a large amount of new experiences after pre-training. In this work, we introduce HippoRAG, a novel retrieval framework inspired by the hippocampal indexing theory of human long-term memory to enable deeper and more efficient knowledge integration over new experiences. HippoRAG synergistically orchestrates LLMs, knowledge graphs, and the Personalized PageRank algorithm to mimic the different roles of neocortex and hippocampus in human memory. We compare HippoRAG with existing RAG methods on multi-hop question answering and show that our method outperforms the state-of-the-art methods remarkably, by up to 20%. Single-step retrieval with HippoRAG achieves comparable or better performance than iterative retrieval like IRCoT while being 10-30 times cheaper and 6-13 times faster, and integrating HippoRAG into IRCoT brings further substantial gains. Finally, we show that our method can tackle new types of scenarios that are out of reach of existing methods. Code and data are available at https://github.com/OSU-NLP-Group/HippoRAG.

Bernal Jim\'enez Guti\'errez, Yiheng Shu, Yu Gu, Michihiro Yasunaga, Yu Su• 2024

Related benchmarks

TaskDatasetResultRank
Multi-hop Question Answering2WikiMultihopQA
EM59.1
387
Multi-hop Question AnsweringHotpotQA
F1 Score70.5
294
Multi-hop Question AnsweringHotpotQA (test)
F164.7
255
Multi-hop Question Answering2WikiMultiHopQA (test)
EM57.8
195
Multi-hop Question AnsweringMuSiQue
EM11.7
185
Multi-hop Question Answering2WikiMQA
F1 Score62.38
161
Question Answering2Wiki
F169.7
152
Multi-hop Question Answering2Wiki
Exact Match59.4
152
Question AnsweringHotpotQA
F171.1
128
Multi-hop Question AnsweringMuSiQue (test)
F135.9
111
Showing 10 of 143 rows
...

Other info

Follow for update