In-depth Analysis of Graph-based RAG in a Unified Framework
About
Graph-based Retrieval-Augmented Generation (RAG) has proven effective in integrating external knowledge into large language models (LLMs), improving their factual accuracy, adaptability, interpretability, and trustworthiness. A number of graph-based RAG methods have been proposed in the literature. However, these methods have not been systematically and comprehensively compared under the same experimental settings. In this paper, we first summarize a unified framework to incorporate all graph-based RAG methods from a high-level perspective. We then extensively compare representative graph-based RAG methods over a range of questing-answering (QA) datasets -- from specific questions to abstract questions -- and examine the effectiveness of all methods, providing a thorough analysis of graph-based RAG approaches. As a byproduct of our experimental analysis, we are also able to identify new variants of the graph-based RAG methods over specific QA and abstract QA tasks respectively, by combining existing techniques, which outperform the state-of-the-art methods. Finally, based on these findings, we offer promising research opportunities. We believe that a deeper understanding of the behavior of existing methods can provide new valuable insights for future research.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Attributed Question Answering | ALCE | STRREC Score41.023 | 24 | |
| Multi-hop Question Answering | MultihopQA | Accuracy59.664 | 24 | |
| Multiple-choice Question Answering | QUALITY | -- | 19 | |
| Question Answering | MusiqueQA | Accuracy26.933 | 16 | |
| Question Answering | PopQA | -- | 13 | |
| Question Answering | HotpotQA | -- | 10 | |
| Multi-hop Question Answering | MusiqueQA | Accuracy26.933 | 8 |