MURRE: Multi-Hop Table Retrieval with Removal for Open-Domain Text-to-SQL
About
The open-domain text-to-SQL task aims to retrieve question-relevant tables from massive databases and generate SQL. However, the performance of current methods is constrained by single-hop retrieval, and existing multi-hop retrieval of open-domain question answering is not directly applicable due to the tendency to retrieve tables similar to the retrieved ones but irrelevant to the question. Since the questions in text-to-SQL usually contain all required information, while previous multi-hop retrieval supplements the questions with retrieved documents. Therefore, we propose the multi-hop table retrieval with removal (MURRE), which removes previously retrieved information from the question to guide the retriever towards unretrieved relevant tables. Our experiments on two open-domain text-to-SQL datasets demonstrate an average improvement of 5.7% over the previous state-of-the-art results.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Text-to-SQL | Spider (test) | -- | 213 | |
| Text-to-SQL | Spider | Exec Acc (All)70.5 | 139 | |
| Text-to-SQL | BIRD (test) | -- | 46 | |
| Text-to-SQL | Spider 2.0 (test) | Execution Accuracy (Spider 2.0 Test)7.4 | 27 | |
| Text-to-SQL | Bird | Execution Accuracy (Llama-8B)25.3 | 21 | |
| Text-to-SQL | Spider 2.0 | Llama-8B Execution Accuracy0.9 | 21 | |
| Table Retrieval | Spider Lite 2.0 (test) | Precision28.1 | 20 | |
| Table Retrieval | Spider union (test) | Precision (P)40.5 | 20 | |
| Table Retrieval | BIRD union (test) | Precision44.6 | 20 |