Debate on Graph: a Flexible and Reliable Reasoning Framework for Large Language Models

About

Large Language Models (LLMs) may suffer from hallucinations in real-world applications due to the lack of relevant knowledge. In contrast, knowledge graphs encompass extensive, multi-relational structures that store a vast array of symbolic facts. Consequently, integrating LLMs with knowledge graphs has been extensively explored, with Knowledge Graph Question Answering (KGQA) serving as a critical touchstone for the integration. This task requires LLMs to answer natural language questions by retrieving relevant triples from knowledge graphs. However, existing methods face two significant challenges: \textit{excessively long reasoning paths distracting from the answer generation}, and \textit{false-positive relations hindering the path refinement}. In this paper, we propose an iterative interactive KGQA framework that leverages the interactive learning capabilities of LLMs to perform reasoning and Debating over Graphs (DoG). Specifically, DoG employs a subgraph-focusing mechanism, allowing LLMs to perform answer trying after each reasoning step, thereby mitigating the impact of lengthy reasoning paths. On the other hand, DoG utilizes a multi-role debate team to gradually simplify complex questions, reducing the influence of false-positive relations. This debate mechanism ensures the reliability of the reasoning process. Experimental results on five public datasets demonstrate the effectiveness and superiority of our architecture. Notably, DoG outperforms the state-of-the-art method ToG by 23.7\% and 9.1\% in accuracy on WebQuestions and GrailQA, respectively. Furthermore, the integration experiments with various LLMs on the mentioned datasets highlight the flexibility of DoG. Code is available at \url{https://github.com/reml-group/DoG}.

Jie Ma, Zhitao Gao, Qi Chai, Wangchun Sun, Pinghui Wang, Hongbin Pei, Jing Tao, Lingyun Song, Jun Liu, Chen Zhang, Lizhen Cui• 2024

Related benchmarks

Task	Dataset	Result
Knowledge Graph Question Answering	CWQ	Hit@158.4	212
Knowledge Graph Question Answering	WebQSP	Hit@191.2	174
Knowledge Graph Question Answering	CWQ (test)	Hits@141	125
Knowledge Graph Question Answering	WEBQSP (test)	Hit65.4	85
Multi-hop Knowledge Graph Question Answering	WebQSP	Hits@191	69
Multi-hop Knowledge Graph Question Answering	GrailQA	Hits@180	68
Multi-hop Knowledge Graph Question Answering	CWQ	Hits@156	64
Knowledge Base Question Answering	WebQSP Freebase (test)	--	60
Knowledge Base Question Answering	WebQSP	--	53
Knowledge Base Question Answering	GrailQA Freebase (test)	Hits@180	48

Showing 10 of 16 rows

Other info

Follow for update

@wizwand_team Discord