HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models

About

In order to thrive in hostile and ever-changing natural environments, mammalian brains evolved to store large amounts of knowledge about the world and continually integrate new information while avoiding catastrophic forgetting. Despite the impressive accomplishments, large language models (LLMs), even with retrieval-augmented generation (RAG), still struggle to efficiently and effectively integrate a large amount of new experiences after pre-training. In this work, we introduce HippoRAG, a novel retrieval framework inspired by the hippocampal indexing theory of human long-term memory to enable deeper and more efficient knowledge integration over new experiences. HippoRAG synergistically orchestrates LLMs, knowledge graphs, and the Personalized PageRank algorithm to mimic the different roles of neocortex and hippocampus in human memory. We compare HippoRAG with existing RAG methods on multi-hop question answering and show that our method outperforms the state-of-the-art methods remarkably, by up to 20%. Single-step retrieval with HippoRAG achieves comparable or better performance than iterative retrieval like IRCoT while being 10-30 times cheaper and 6-13 times faster, and integrating HippoRAG into IRCoT brings further substantial gains. Finally, we show that our method can tackle new types of scenarios that are out of reach of existing methods. Code and data are available at https://github.com/OSU-NLP-Group/HippoRAG.

Bernal Jim\'enez Guti\'errez, Yiheng Shu, Yu Gu, Michihiro Yasunaga, Yu Su• 2024

Related benchmarks

Task	Dataset	Result
Multi-hop Question Answering	2WikiMultihopQA	EM59.1	559
Multi-hop Question Answering	HotpotQA (test)	F164.7	311
Multi-hop Question Answering	HotpotQA	F1 Score70.5	294
Question Answering	2Wiki	EM59.4	241
Multi-hop Question Answering	2WikiMultiHopQA (test)	EM57.8	226
Multi-hop Question Answering	2Wiki	Exact Match59.4	215
Multi-hop Question Answering	MuSiQue	EM11.7	209
Long-context Question Answering	Locomo	F1 (Multi Hop)33.59	171
Multi-hop Question Answering	2WikiMQA	F1 Score62.38	161
Multi-hop QA	HotpotQA	Exact Match52.6	143

Showing 10 of 233 rows

...

Other info

Follow for update

@wizwand_team Discord