Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LinearRAG: Linear Graph Retrieval Augmented Generation on Large-scale Corpora

About

Retrieval-Augmented Generation (RAG) is widely used to mitigate hallucinations of Large Language Models (LLMs) by leveraging external knowledge. While effective for simple queries, traditional RAG systems struggle with large-scale, unstructured corpora where information is fragmented. Recent advances incorporate knowledge graphs to capture relational structures, enabling more comprehensive retrieval for complex, multi-hop reasoning tasks. However, existing graph-based RAG (GraphRAG) methods rely on unstable and costly relation extraction for graph construction, often producing noisy graphs with incorrect or inconsistent relations that degrade retrieval quality. In this paper, we revisit the pipeline of existing GraphRAG systems and propose LinearRAG (Linear Graph-based Retrieval-Augmented Generation), an efficient framework that enables reliable graph construction and precise passage retrieval. Specifically, LinearRAG constructs a relation-free hierarchical graph, termed Tri-Graph, using only lightweight entity extraction and semantic linking, avoiding unstable relation modeling. This new paradigm of graph construction scales linearly with corpus size and incurs no extra token consumption, providing an economical and reliable indexing of the original passages. For retrieval, LinearRAG adopts a two-stage strategy: (i) relevant entity activation via local semantic bridging, followed by (ii) passage retrieval through global importance aggregation. Extensive experiments on four datasets demonstrate that LinearRAG significantly outperforms baseline models. Our code and datasets are available at https://github.com/DEEP-PolyU/LinearRAG.

Luyao Zhuang, Shengyuan Chen, Yilin Xiao, Huachi Zhou, Yujing Zhang, Hao Chen, Qinggang Zhang, Xiao Huang• 2025

Related benchmarks

TaskDatasetResultRank
Multi-hop Question AnsweringHotpotQA (test)
F159
198
Multi-hop Question Answering2WikiMultiHopQA (test)--
143
Multi-hop Question AnsweringMuSiQue (test)
F131
111
Question AnsweringNQ (test)--
66
Multi-hop QAMuSiQue
EM18.7
42
Multi-hop QAHotpotQA
Exact Match39.6
33
Multi-hop QA2Wiki
EM42.4
26
Question Answering2WikiMultihopQA
LLM-Acc87.2
20
Question AnsweringMuSiQue
LLM Accuracy62.4
20
Question AnsweringHotpotQA
LLM Accuracy86.2
20
Showing 10 of 21 rows

Other info

Follow for update