HugRAG: Hierarchical Causal Knowledge Graph Design for RAG
About
Retrieval augmented generation (RAG) has enhanced large language models by enabling access to external knowledge, with graph-based RAG emerging as a powerful paradigm for structured retrieval and reasoning. However, existing graph-based methods often over-rely on surface-level node matching and lack explicit causal modeling, leading to unfaithful or spurious answers. Prior attempts to incorporate causality are typically limited to local or single-document contexts and also suffer from information isolation that arises from modular graph structures, which hinders scalability and cross-module causal reasoning. To address these challenges, we propose HugRAG, a framework that rethinks knowledge organization for graph-based RAG through causal gating across hierarchical modules. HugRAG explicitly models causal relationships to suppress spurious correlations while enabling scalable reasoning over large-scale knowledge graphs. Extensive experiments demonstrate that HugRAG consistently outperforms competitive graph-based RAG baselines across multiple datasets and evaluation metrics. Our work establishes a principled foundation for structured, scalable, and causally grounded RAG systems.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Question Answering | 2Wiki | F131.97 | 152 | |
| Question Answering | HolisQA Medicine | F1 Score36.45 | 10 | |
| Question Answering | HolisQA Computer Science | F131.6 | 10 | |
| Question Answering | HolisQA Business | F1 Score51.51 | 10 | |
| Question Answering | HolisQA Biology | F1 Score34.8 | 10 | |
| Question Answering | HolisQA Psychology | F1 Score44.42 | 10 | |
| Question Answering | MSMARCO | F1 Score38.4 | 10 | |
| Question Answering | NQ | F1 Score49.5 | 10 | |
| Question Answering | HotpotQA | F1 Score64.83 | 10 | |
| Question Answering | QASC | F113.35 | 10 |