Hyper-RAG: Combating LLM Hallucinations using Hypergraph-Driven Retrieval-Augmented Generation

About

Large language models (LLMs) have transformed various sectors, including education, finance, and medicine, by enhancing content generation and decision-making processes. However, their integration into the medical field is cautious due to hallucinations, instances where generated content deviates from factual accuracy, potentially leading to adverse outcomes. To address this, we introduce Hyper-RAG, a hypergraph-driven Retrieval-Augmented Generation method that comprehensively captures both pairwise and beyond-pairwise correlations in domain-specific knowledge, thereby mitigating hallucinations. Experiments on the NeurologyCrop dataset with six prominent LLMs demonstrated that Hyper-RAG improves accuracy by an average of 12.3% over direct LLM use and outperforms Graph RAG and Light RAG by 6.3% and 6.0%, respectively. Additionally, Hyper-RAG maintained stable performance with increasing query complexity, unlike existing methods which declined. Further validation across nine diverse datasets showed a 35.5% performance improvement over Light RAG using a selection-based assessment. The lightweight variant, Hyper-RAG-Lite, achieved twice the retrieval speed and a 3.3% performance boost compared with Light RAG. These results confirm Hyper-RAG's effectiveness in enhancing LLM reliability and reducing hallucinations, making it a robust solution for high-stakes applications like medical diagnostics.

Yifan Feng, Hao Hu, Xingliang Hou, Shiquan Liu, Shihui Ying, Shaoyi Du, Han Hu, Yue Gao• 2025

Related benchmarks

Task	Dataset	Result
Multi-hop Question Answering	HotpotQA (test)	F154.9	334
Multi-hop Question Answering	MuSiQue (test)	F132.6	151
Multi-hop Question Answering	HotpotQA (dev)	Answer F171.074	72
Multi-hop Question Answering	Musique (dev)	F1 Score35.632	29
Fact Verification	MINE	Accuracy81.73	28
Complex Reasoning	GraphRAG-Bench	Rel72.11	27
Retrieval	MisstepMath	Cosine Similarity62.46	16
Multi-hop QA	2Wiki (test)	EM45.3	10
Question Answering	UltraDomain Mix 1.0 (512 question-answer pairs)	Exact Match (EM)39.25	10
Question Answering	UltraDomain Computer Science 1.0	EM25.97	10

Showing 10 of 43 rows

Other info

Follow for update

@wizwand_team Discord