Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Hyper-RAG: Combating LLM Hallucinations using Hypergraph-Driven Retrieval-Augmented Generation

About

Large language models (LLMs) have transformed various sectors, including education, finance, and medicine, by enhancing content generation and decision-making processes. However, their integration into the medical field is cautious due to hallucinations, instances where generated content deviates from factual accuracy, potentially leading to adverse outcomes. To address this, we introduce Hyper-RAG, a hypergraph-driven Retrieval-Augmented Generation method that comprehensively captures both pairwise and beyond-pairwise correlations in domain-specific knowledge, thereby mitigating hallucinations. Experiments on the NeurologyCrop dataset with six prominent LLMs demonstrated that Hyper-RAG improves accuracy by an average of 12.3% over direct LLM use and outperforms Graph RAG and Light RAG by 6.3% and 6.0%, respectively. Additionally, Hyper-RAG maintained stable performance with increasing query complexity, unlike existing methods which declined. Further validation across nine diverse datasets showed a 35.5% performance improvement over Light RAG using a selection-based assessment. The lightweight variant, Hyper-RAG-Lite, achieved twice the retrieval speed and a 3.3% performance boost compared with Light RAG. These results confirm Hyper-RAG's effectiveness in enhancing LLM reliability and reducing hallucinations, making it a robust solution for high-stakes applications like medical diagnostics.

Yifan Feng, Hao Hu, Xingliang Hou, Shiquan Liu, Shihui Ying, Shaoyi Du, Han Hu, Yue Gao• 2025

Related benchmarks

TaskDatasetResultRank
Multi-hop Question AnsweringHotpotQA (test)
F154.9
198
Multi-hop Question AnsweringMuSiQue (test)
F132.6
111
Fact VerificationMINE
Accuracy81.73
28
Multi-hop QA2Wiki (test)
EM45.3
10
Simple QAPopQA (test)
EM49.1
10
Question AnsweringOverall Average (test)
EM50.9
10
Explanatory QAMix (test)
EM71.4
10
Explanatory QAPathology (test)
EM75.9
10
Retrieval-Augmented GenerationUltraDomain Mix (test)
Comprehensiveness90.5
9
Retrieval-Augmented GenerationUltraDomain Pathology (test)
Comprehension91.4
9
Showing 10 of 11 rows

Other info

Follow for update