Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation

About

The robustness of Text-to-SQL parsers against adversarial perturbations plays a crucial role in delivering highly reliable applications. Previous studies along this line primarily focused on perturbations in the natural language question side, neglecting the variability of tables. Motivated by this, we propose the Adversarial Table Perturbation (ATP) as a new attacking paradigm to measure the robustness of Text-to-SQL models. Following this proposition, we curate ADVETA, the first robustness evaluation benchmark featuring natural and realistic ATPs. All tested state-of-the-art models experience dramatic performance drops on ADVETA, revealing models' vulnerability in real-world practices. To defend against ATP, we build a systematic adversarial training example generation framework tailored for better contextualization of tabular data. Experiments show that our approach not only brings the best robustness improvement against table-side perturbations but also substantially empowers models against NL-side perturbations. We release our benchmark and code at: https://github.com/microsoft/ContextualSP.

Xinyu Pi, Bing Wang, Yan Gao, Jiaqi Guo, Zhoujun Li, Jian-Guang Lou• 2022

Related benchmarks

Task	Dataset	Result
Text-to-SQL	Spider (dev)	--	147
Table Question Answering	WikiTQ (test)	Accuracy49.3	140
Table Question Answering	WikiSQL (test)	Accuracy85.9	55
Table Question Answering	FeTaQA	S-BLEU33.4	12
Text-to-SQL	Spider-Syn (dev)	Exact Match Accuracy60.4	11
Text-to-SQL	Spider (ADVETA-RPL)	Exact Match (EM)35.8	10
Text-to-SQL	Spider ADVETA-ADD	Exact Match (EM)50.6	10
Text-to-SQL	WikiSQL ADVETA-RPL	Exact Match (EM)69.2	9
Text-to-SQL	WikiSQL ADVETA-ADD	Exact Match (EM)79.9	9
Text-to-SQL	WikiSQL original (dev)	Exact Match (EM)81.2	9

Showing 10 of 19 rows

Other info

Code

Follow for update

@wizwand_team Discord