HopRAG: Multi-Hop Reasoning for Logic-Aware Retrieval-Augmented Generation
About
Retrieval-Augmented Generation (RAG) systems often struggle with imperfect retrieval, as traditional retrievers focus on lexical or semantic similarity rather than logical relevance. To address this, we propose \textbf{HopRAG}, a novel RAG framework that augments retrieval with logical reasoning through graph-structured knowledge exploration. During indexing, HopRAG constructs a passage graph, with text chunks as vertices and logical connections established via LLM-generated pseudo-queries as edges. During retrieval, it employs a \textit{retrieve-reason-prune} mechanism: starting with lexically or semantically similar passages, the system explores multi-hop neighbors guided by pseudo-queries and LLM reasoning to identify truly relevant ones. Experiments on multiple multi-hop benchmarks demonstrate that HopRAG's \textit{retrieve-reason-prune} mechanism can expand the retrieval scope based on logical connections and improve final answer quality.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-hop Question Answering | 2WikiMultihopQA | EM61.1 | 278 | |
| Retrieval | Natural Questions (test) | Top-5 Recall74.4 | 62 | |
| Multi-hop QA | MuSiQue | EM42.2 | 42 | |
| Multi-hop QA | HotpotQA | Exact Match62 | 33 | |
| Multi-hop QA Retrieval | MuSiQue (test) | R@253.7 | 28 | |
| Multi-hop QA Retrieval | 2WikiMultiHopQA (test) | R@261.7 | 28 | |
| Multi-hop QA | 2Wiki | EM0.611 | 26 | |
| Single-hop QA | NQ (Natural Questions) | EM42.9 | 22 | |
| Retrieval | HotpotQA (test) | Recall@281.1 | 20 | |
| Single-hop QA Retrieval | NaturalQuestions (NQ) (test) | R@243.9 | 8 |