Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Rethinking Table Instruction Tuning

About

Recent advances in table understanding have focused on instruction-tuning large language models (LLMs) for table-related tasks. However, existing research has overlooked the impact of hyperparameter choices, and also lacks a comprehensive evaluation of the out-of-domain table understanding ability and the general capabilities of these table LLMs. In this paper, we evaluate these abilities in existing table LLMs, and find significant declines in both out-of-domain table understanding and general capabilities as compared to their base models. Through systematic analysis, we show that hyperparameters, such as learning rate, can significantly influence both table-specific and general capabilities. Contrary to the previous table instruction-tuning work, we demonstrate that smaller learning rates and fewer training instances can enhance table understanding while preserving general capabilities. Based on our findings, we introduce TAMA, a TAble LLM instruction-tuned from LLaMA 3.1 8B Instruct, which achieves performance on par with, or surpassing GPT-3.5 and GPT-4 on table tasks, while maintaining strong out-of-domain generalization and general capabilities. Our findings highlight the potential for reduced data annotation costs and more efficient model development through careful hyperparameter selection. We open-source the project and our models.

Naihao Deng, Rada Mihalcea• 2025

Related benchmarks

TaskDatasetResultRank
Table Fact VerificationTabFact (test)
Accuracy65.09
98
Table Question AnsweringWikiTableQuestions (test)
Accuracy52.88
86
Question AnsweringGPQA (test)
Accuracy31.92
55
Table Question AnsweringWikiSQL (test)
Accuracy68.31
55
Table Fact VerificationTabFact (dev)
Accuracy73.82
28
Table Question AnsweringTAT-QA (test)
Accuracy48.47
15
Table UnderstandingMMTU (test)
Overall Performance33.9
12
Table Fact VerificationFEVEROUS (dev)
Accuracy77.39
8
Science Question AnsweringAI2ARC (test)
Accuracy81.23
6
Table Fact VerificationFeT (dev)
Accuracy35.37
4
Showing 10 of 21 rows

Other info

Follow for update