Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Rethinking Tabular Data Understanding with Large Language Models

About

Large Language Models (LLMs) have shown to be capable of various tasks, yet their capability in interpreting and reasoning over tabular data remains an underexplored area. In this context, this study investigates from three core perspectives: the robustness of LLMs to structural perturbations in tables, the comparative analysis of textual and symbolic reasoning on tables, and the potential of boosting model performance through the aggregation of multiple reasoning pathways. We discover that structural variance of tables presenting the same content reveals a notable performance decline, particularly in symbolic reasoning tasks. This prompts the proposal of a method for table structure normalization. Moreover, textual reasoning slightly edges out symbolic reasoning, and a detailed error analysis reveals that each exhibits different strengths depending on the specific tasks. Notably, the aggregation of textual and symbolic reasoning pathways, bolstered by a mix self-consistency mechanism, resulted in achieving SOTA performance, with an accuracy of 73.6% on WIKITABLEQUESTIONS, representing a substantial advancement over previous existing table processing paradigms of LLMs.

Tianyang Liu, Fei Wang, Muhao Chen• 2023

Related benchmarks

TaskDatasetResultRank
Table Question AnsweringWikiTQ (test)
Accuracy56.87
130
Table Question AnsweringWikiTQ
Accuracy73.65
118
Table Fact VerificationTabFact
Accuracy0.885
104
Table Question AnsweringWTQ
Accuracy72.5
101
Table Question AnsweringWikiTableQuestions (test)
Accuracy73.6
86
Table Question AnsweringTableBench
EM61.9
40
Table Question AnsweringPenguins in a Table
EM92.4
40
Table Question AnsweringWikiTable Questions (WTQ)
Accuracy73.7
28
Table Question AnsweringWikiTableQuestions (WTQ) T (original tables) (sampled subset)
Accuracy73.65
18
Table-based Question AnsweringWikiTableQuestions 17 (test)
Accuracy73.6
12
Showing 10 of 14 rows

Other info

Follow for update