Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Understanding tables with intermediate pre-training

About

Table entailment, the binary classification task of finding if a sentence is supported or refuted by the content of a table, requires parsing language and table structure as well as numerical and discrete reasoning. While there is extensive work on textual entailment, table entailment is less well studied. We adapt TAPAS (Herzig et al., 2020), a table-based BERT model, to recognize entailment. Motivated by the benefits of data augmentation, we create a balanced dataset of millions of automatically created training examples which are learned in an intermediate step prior to fine-tuning. This new data is not only useful for table entailment, but also for SQA (Iyyer et al., 2017), a sequential table QA task. To be able to use long examples as input of BERT models, we evaluate table pruning techniques as a pre-processing step to drastically improve the training and prediction efficiency at a moderate drop in accuracy. The different methods set the new state-of-the-art on the TabFact (Chen et al., 2020) and SQA datasets.

Julian Martin Eisenschlos, Syrine Krichene, Thomas M\"uller• 2020

Related benchmarks

TaskDatasetResultRank
Table Fact VerificationTabFact (test)
Accuracy81
98
Table Fact VerificationTabFact small (test)
Accuracy0.839
57
Table Fact VerificationTABFACT simple (test)
Accuracy92.3
39
Table Fact VerificationTABFACT complex (test)
Accuracy75.6
39
Sequential Question AnsweringSQA (test)
Accuracy (All)71
33
Table Fact VerificationTabFact (dev)
Accuracy81
28
Table Fact VerificationTABFACT (val)
Accuracy81
21
Binary ClassificationTabFact (test)
Accuracy81
18
Binary ClassificationTABFACT simple (test)
Accuracy92.3
17
Binary ClassificationTABFACT complex (test)
Accuracy75.6
17
Showing 10 of 16 rows

Other info

Code

Follow for update