StructGPT: A General Framework for Large Language Model to Reason over Structured Data

About

In this paper, we study how to improve the zero-shot reasoning ability of large language models~(LLMs) over structured data in a unified way. Inspired by the study on tool augmentation for LLMs, we develop an \emph{Iterative Reading-then-Reasoning~(IRR)} approach for solving question answering tasks based on structured data, called \textbf{StructGPT}. In our approach, we construct the specialized function to collect relevant evidence from structured data (\ie \emph{reading}), and let LLMs concentrate the reasoning task based on the collected information (\ie \emph{reasoning}). Specially, we propose an \emph{invoking-linearization-generation} procedure to support LLMs in reasoning on the structured data with the help of the external interfaces. By iterating this procedures with provided interfaces, our approach can gradually approach the target answer to a given query. Extensive experiments conducted on three types of structured data demonstrate the effectiveness of our approach, which can significantly boost the performance of ChatGPT and achieve comparable performance against the full-data supervised-tuning baselines. Our codes and data are publicly available at~\url{https://github.com/RUCAIBox/StructGPT}.

Jinhao Jiang, Kun Zhou, Zican Dong, Keming Ye, Wayne Xin Zhao, Ji-Rong Wen• 2023

Related benchmarks

Task	Dataset	Result
Question Answering	ARC Challenge	Accuracy83.28	906
Multi-task Language Understanding	MMLU	Accuracy61.4	881
Language Understanding	MMLU	Accuracy83.41	844
Reasoning	BBH	Accuracy71.93	726
Question Answering	ARC Easy	Normalized Acc86.87	391
Reading Comprehension	RACE high	Accuracy79.87	295
Knowledge Graph Question Answering	CWQ	Hit@154.3	212
Reading Comprehension	RACE mid	Accuracy81.27	196
Knowledge Graph Question Answering	WebQSP	Hit@172.6	174
Question Answering	CommonsenseQA	Accuracy71.58	150

Showing 10 of 39 rows

Other info

Follow for update

@wizwand_team Discord