On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries

About

Large-scale semantic parsing datasets annotated with logical forms have enabled major advances in supervised approaches. But can richer supervision help even more? To explore the utility of fine-grained, lexical-level supervision, we introduce Squall, a dataset that enriches 11,276 WikiTableQuestions English-language questions with manually created SQL equivalents plus alignments between SQL and question fragments. Our annotation enables new training possibilities for encoder-decoder models, including approaches from machine translation previously precluded by the absence of alignments. We propose and test two methods: (1) supervised attention; (2) adopting an auxiliary objective of disambiguating references in the input queries to table columns. In 5-fold cross validation, these strategies improve over strong baselines by 4.4% execution accuracy. Oracle experiments suggest that annotated alignments can support further accuracy gains of up to 23.9%.

Tianze Shi, Chen Zhao, Jordan Boyd-Graber, Hal Daum\'e III, Lillian Lee• 2020

Related benchmarks

Task	Dataset	Result
Text-to-SQL	WTQ original (dev)	Exact Match (EM)44.1	8
Text-to-SQL	WTQ (ADVETA-RPL)	Exact Match (EM)22.8	8
Text-to-SQL	WTQ (ADVETA-ADD)	Exact Match (EM)32.9	8

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord