HopRAG: Multi-Hop Reasoning for Logic-Aware Retrieval-Augmented Generation

About

Retrieval-Augmented Generation (RAG) systems often struggle with imperfect retrieval, as traditional retrievers focus on lexical or semantic similarity rather than logical relevance. To address this, we propose \textbf{HopRAG}, a novel RAG framework that augments retrieval with logical reasoning through graph-structured knowledge exploration. During indexing, HopRAG constructs a passage graph, with text chunks as vertices and logical connections established via LLM-generated pseudo-queries as edges. During retrieval, it employs a \textit{retrieve-reason-prune} mechanism: starting with lexically or semantically similar passages, the system explores multi-hop neighbors guided by pseudo-queries and LLM reasoning to identify truly relevant ones. Experiments on multiple multi-hop benchmarks demonstrate that HopRAG's \textit{retrieve-reason-prune} mechanism can expand the retrieval scope based on logical connections and improve final answer quality.

Hao Liu, Zhengren Wang, Xi Chen, Zhiyu Li, Feiyu Xiong, Qinhan Yu, Wentao Zhang• 2025

Related benchmarks

Task	Dataset	Result
Multi-hop Question Answering	2WikiMultihopQA	EM61.1	559
Multi-hop QA	HotpotQA	Exact Match62	143
Multi-hop QA	MuSiQue	EM42.2	95
Retrieval	Natural Questions (test)	Top-5 Recall74.4	62
Single-hop QA	NQ (Natural Questions)	EM42.9	52
Multi-hop QA	2Wiki	EM0.611	42
Multi-hop QA Retrieval	MuSiQue (test)	R@566.8	33
Multi-hop QA Retrieval	2WikiMultiHopQA (test)	R@570.1	33
Document Question Answering	M3DocVQA	Exact Match22.8	24
Multi-hop document retrieval	HotpotQA (test)	--	24

Showing 10 of 20 rows

Other info

Follow for update

@wizwand_team Discord