Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation

About

Recent advancements in large language models (LLMs) have shown impressive versatility across various tasks. To eliminate their hallucinations, retrieval-augmented generation (RAG) has emerged as a powerful approach, leveraging external knowledge sources like knowledge graphs (KGs). In this paper, we study the task of KG-driven RAG and propose a novel Similar Graph Enhanced Retrieval-Augmented Generation (SimGRAG) method. It effectively addresses the challenge of aligning query texts and KG structures through a two-stage process: (1) query-to-pattern, which uses an LLM to transform queries into a desired graph pattern, and (2) pattern-to-subgraph, which quantifies the alignment between the pattern and candidate subgraphs using a graph semantic distance (GSD) metric. We also develop an optimized retrieval algorithm that efficiently identifies the top-k subgraphs within 1-second on a 10-million-scale KG. Extensive experiments show that SimGRAG outperforms state-of-the-art KG-driven RAG methods in both question answering and fact verification. Our code is available at https://github.com/YZ-Cai/SimGRAG.

Yuzheng Cai, Zhenyue Guo, Yiwen Pei, Wanrui Bian, Weiguo Zheng• 2024

Related benchmarks

TaskDatasetResultRank
Question AnsweringMetaQA 3-hop
Hits@197.8
38
Knowledge Base Question AnsweringMetaQA 1hop
Hits@198
28
Claim VerificationFactKG (test)
Average Accuracy86.8
20
Knowledge Graph Question AnsweringMetaQA 2-hop (test)
Hits@198.4
20
Knowledge Graph Question AnsweringPathQuestions (PQ) 2-hop (test)
Hits@188.7
4
Knowledge Graph Question AnsweringPathQuestions (PQ) 3-hop (test)
Hits@178.6
4
Knowledge Graph Question AnsweringWorldCup 2014
Hits@198.1
4
Showing 7 of 7 rows

Other info

Code

Follow for update