Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PathRAG: Pruning Graph-based Retrieval Augmented Generation with Relational Paths

About

Retrieval-augmented generation (RAG) improves the response quality of large language models (LLMs) by retrieving knowledge from external databases. Typical RAG approaches split the text database into chunks, organizing them in a flat structure for efficient searches. To better capture the inherent dependencies and structured relationships across the text database, researchers propose to organize textual information into an indexing graph, known asgraph-based RAG. However, we argue that the limitation of current graph-based RAG methods lies in the redundancy of the retrieved information, rather than its insufficiency. Moreover, previous methods use a flat structure to organize retrieved information within the prompts, leading to suboptimal performance. To overcome these limitations, we propose PathRAG, which retrieves key relational paths from the indexing graph, and converts these paths into textual form for prompting LLMs. Specifically, PathRAG effectively reduces redundant information with flow-based pruning, while guiding LLMs to generate more logical and coherent responses with path-based prompting. Experimental results show that PathRAG consistently outperforms state-of-the-art baselines across six datasets and five evaluation dimensions. The code is available at the following link: https://github.com/BUPT-GAMMA/PathRAG

Boyu Chen, Zirui Guo, Zidan Yang, Yuluo Chen, Junze Chen, Zhenghao Liu, Chuan Shi, Cheng Yang• 2025

Related benchmarks

TaskDatasetResultRank
Multi-hop Question Answering2WikiMultihopQA
EM23.1
387
Multi-hop Question AnsweringHotpotQA
F1 Score42.9
294
Multi-hop Question AnsweringHotpotQA (test)
F155.8
255
Multi-hop Question AnsweringMuSiQue (test)
F131.1
111
Question Answering2WikiMultiHopQA (test)
F112.42
81
Multi-hop Question AnsweringMuSiQue
F122.1
38
Biomedical Multi-hop Question AnsweringCondMedQA
EM53
36
Biomedical Multi-hop Question AnsweringBioASQ-B
EM24.55
18
Retrieval-Augmented GenerationBSARD-G
Win Rate42.8
18
Retrieval-Augmented GenerationUltraDomain agriculture
Win Rate38
18
Showing 10 of 31 rows

Other info

Follow for update