Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph

About

Although large language models (LLMs) have achieved significant success in various tasks, they often struggle with hallucination problems, especially in scenarios requiring deep and responsible reasoning. These issues could be partially addressed by introducing external knowledge graphs (KG) in LLM reasoning. In this paper, we propose a new LLM-KG integrating paradigm ``$\hbox{LLM}\otimes\hbox{KG}$'' which treats the LLM as an agent to interactively explore related entities and relations on KGs and perform reasoning based on the retrieved knowledge. We further implement this paradigm by introducing a new approach called Think-on-Graph (ToG), in which the LLM agent iteratively executes beam search on KG, discovers the most promising reasoning paths, and returns the most likely reasoning results. We use a number of well-designed experiments to examine and illustrate the following advantages of ToG: 1) compared with LLMs, ToG has better deep reasoning power; 2) ToG has the ability of knowledge traceability and knowledge correctability by leveraging LLMs reasoning and expert feedback; 3) ToG provides a flexible plug-and-play framework for different LLMs, KGs and prompting strategies without any additional training cost; 4) the performance of ToG with small LLM models could exceed large LLM such as GPT-4 in certain scenarios and this reduces the cost of LLM deployment and application. As a training-free method with lower computational cost and better generality, ToG achieves overall SOTA in 6 out of 9 datasets where most previous SOTAs rely on additional training.

Jiashuo Sun, Chengjin Xu, Lumingyuan Tang, Saizhuo Wang, Chen Lin, Yeyun Gong, Lionel M. Ni, Heung-Yeung Shum, Jian Guo• 2023

Related benchmarks

Task	Dataset	Result
Multi-hop Question Answering	2WikiMultihopQA	EM5.2	559
Multi-hop Question Answering	HotpotQA (test)	F129.2	311
Multi-hop Question Answering	HotpotQA	F1 Score40.07	294
Multi-hop Question Answering	2WikiMultiHopQA (test)	EM32.8	226
Knowledge Graph Question Answering	CWQ	Hit@169.5	212
Knowledge Graph Question Answering	WebQSP	Hit@182.6	174
Knowledge Base Question Answering	WEBQSP (test)	Hit@182.6	145
Knowledge Graph Question Answering	CWQ (test)	Hits@170.87	125
Knowledge Graph Question Answering	WEBQSP (test)	Hit81.9	85
Multi-hop Question Answering	HotpotQA	F111	79

Showing 10 of 111 rows

...

Other info

Follow for update

@wizwand_team Discord