Synthesizing Text-to-SQL Data from Weak and Strong LLMs

About

The capability gap between open-source and closed-source large language models (LLMs) remains a challenge in text-to-SQL tasks. In this paper, we introduce a synthetic data approach that combines data produced by larger, more powerful models (strong models) with error information data generated by smaller, not well-aligned models (weak models). The method not only enhances the domain generalization of text-to-SQL models but also explores the potential of error data supervision through preference learning. Furthermore, we employ the synthetic data approach for instruction tuning on open-source LLMs, resulting SENSE, a specialized text-to-SQL model. The effectiveness of SENSE is demonstrated through state-of-the-art results on the SPIDER and BIRD benchmarks, bridging the performance gap between open-source models and methods prompted by closed-source models.

Jiaxi Yang, Binyuan Hui, Min Yang, Jian Yang, Junyang Lin, Chang Zhou• 2024

Related benchmarks

Task	Dataset	Result
Text-to-SQL	BIRD (dev)	Execution Accuracy (EA)55.5	387
Text-to-SQL	Spider (test)	Execution Accuracy86.6	213
Text-to-SQL	Spider (dev)	EX84.1	147
Text-to-SQL	Spider-DK	Execution Accuracy (EX)80.2	95
Text-to-SQL	Spider-Syn	Execution Accuracy (EX)77.6	79
Text-to-SQL	Spider-Realistic	Execution Accuracy (EX)84.1	47
Text-to-SQL	Spider Robustness Suite SYN REALISTIC DK (dev)	Execution Accuracy (SYN)77.6	6

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord