Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation

About

We present a neural approach called IRNet for complex and cross-domain Text-to-SQL. IRNet aims to address two challenges: 1) the mismatch between intents expressed in natural language (NL) and the implementation details in SQL; 2) the challenge in predicting columns caused by the large number of out-of-domain words. Instead of end-to-end synthesizing a SQL query, IRNet decomposes the synthesis process into three phases. In the first phase, IRNet performs a schema linking over a question and a database schema. Then, IRNet adopts a grammar-based neural model to synthesize a SemQL query which is an intermediate representation that we design to bridge NL and SQL. Finally, IRNet deterministically infers a SQL query from the synthesized SemQL query with domain knowledge. On the challenging Text-to-SQL benchmark Spider, IRNet achieves 46.7% accuracy, obtaining 19.5% absolute improvement over previous state-of-the-art approaches. At the time of writing, IRNet achieves the first position on the Spider leaderboard.

Jiaqi Guo, Zecheng Zhan, Yan Gao, Yan Xiao, Jian-Guang Lou, Ting Liu, Dongmei Zhang• 2019

Related benchmarks

TaskDatasetResultRank
Text-to-SQLSpider (test)
Execution Accuracy55
140
Text-to-SQLSpider (dev)--
100
Text-to-SQLSpider 1.0 (dev)
Exact Match Accuracy63.9
92
Text-to-SQLSpider 1.0 (test)
EM Acc (Overall)55
91
Text-to-SQLSpider-SYN 1.0 (val)
EM Accuracy28.4
10
Text-to-SQLSpider-DK 1.0 (val)
EM33.1
10
Text-to-SQLSpider-Syn (test)
QM (%)28.4
6
Showing 7 of 7 rows

Other info

Follow for update