Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation

About

The robustness of Text-to-SQL parsers against adversarial perturbations plays a crucial role in delivering highly reliable applications. Previous studies along this line primarily focused on perturbations in the natural language question side, neglecting the variability of tables. Motivated by this, we propose the Adversarial Table Perturbation (ATP) as a new attacking paradigm to measure the robustness of Text-to-SQL models. Following this proposition, we curate ADVETA, the first robustness evaluation benchmark featuring natural and realistic ATPs. All tested state-of-the-art models experience dramatic performance drops on ADVETA, revealing models' vulnerability in real-world practices. To defend against ATP, we build a systematic adversarial training example generation framework tailored for better contextualization of tabular data. Experiments show that our approach not only brings the best robustness improvement against table-side perturbations but also substantially empowers models against NL-side perturbations. We release our benchmark and code at: https://github.com/microsoft/ContextualSP.

Xinyu Pi, Bing Wang, Yan Gao, Jiaqi Guo, Zhoujun Li, Jian-Guang Lou• 2022

Related benchmarks

TaskDatasetResultRank
Text-to-SQLSpider (dev)--
100
Table Question AnsweringWikiTQ (test)
Accuracy49.3
92
Table Question AnsweringWikiSQL (test)
Accuracy85.9
55
Table Question AnsweringFeTaQA
S-BLEU33.4
12
Text-to-SQLSpider-Syn (dev)
Exact Match Accuracy60.4
11
Text-to-SQLSpider (ADVETA-RPL)
Exact Match (EM)35.8
10
Text-to-SQLSpider ADVETA-ADD
Exact Match (EM)50.6
10
Text-to-SQLWikiSQL ADVETA-RPL
Exact Match (EM)69.2
9
Text-to-SQLWikiSQL ADVETA-ADD
Exact Match (EM)79.9
9
Text-to-SQLWikiSQL original (dev)
Exact Match (EM)81.2
9
Showing 10 of 19 rows

Other info

Code

Follow for update