ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language Models

About

Knowledge Base Question Answering (KBQA) aims to answer natural language questions over large-scale knowledge bases (KBs), which can be summarized into two crucial steps: knowledge retrieval and semantic parsing. However, three core challenges remain: inefficient knowledge retrieval, mistakes of retrieval adversely impacting semantic parsing, and the complexity of previous KBQA methods. To tackle these challenges, we introduce ChatKBQA, a novel and simple generate-then-retrieve KBQA framework, which proposes first generating the logical form with fine-tuned LLMs, then retrieving and replacing entities and relations with an unsupervised retrieval method, to improve both generation and retrieval more directly. Experimental results show that ChatKBQA achieves new state-of-the-art performance on standard KBQA datasets, WebQSP, and CWQ. This work can also be regarded as a new paradigm for combining LLMs with knowledge graphs (KGs) for interpretable and knowledge-required question answering. Our code is publicly available.

Haoran Luo, Haihong E, Zichen Tang, Shiyao Peng, Yikai Guo, Wentai Zhang, Chenghao Ma, Guanting Dong, Meina Song, Wei Lin, Yifan Zhu, Luu Anh Tuan• 2023

Related benchmarks

Task	Dataset	Result
Knowledge Base Question Answering	WEBQSP (test)	Hit@186.4	145
Knowledge Base Question Answering	WebQSP Freebase (test)	Hits@186.4	60
Knowledge Base Question Answering	CWQ (test)	F1 Score81.3	44
Knowledge Base Question Answering	CWQ Freebase (test)	Hits@186	38
Knowledge Base Question Answering	CWQ	Hits@176	30
Knowledge Graph Question Answering	WebQSP	Hits@186.33	9
Knowledge Base Question Answering	WebQSP	Exact Match (EM)77	2
Knowledge Graph Question Answering	CWQ Freebase	F1 Score70.3	2
Knowledge Graph Question Answering	WQSP Freebase	F1 Score74	2

Showing 9 of 9 rows

Other info

Code

Follow for update

@wizwand_team Discord