| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| BIRD (dev) | CHASE-SQL | Execution Accuracy (EA)74.46 | 217 | 3d ago | |
| Spider (test) | XiYan-SQL | Execution Accuracy89.65 | 140 | 3d ago | |
| Spider (dev) | MCS-SQL + GPT-4 | EX (All)89.5 | 100 | 3d ago | |
| Spider 1.0 (dev) | MAC-SQL | Exact Match Accuracy86.75 | 92 | 3d ago | |
| Spider 1.0 (test) | CHESS+GPT-4 | EM Acc (Overall)87.2 | 91 | 3d ago | |
| LogicCat | IESR | Exact Match24.28 | 58 | 3d ago | |
| Spider | SQL-O1 | Exec Acc (All)86.54 | 57 | 3d ago | |
| Archer (dev) | IESR | Execution Accuracy37.28 | 36 | 3d ago | |
| Spider-Realistic | DCG-SQL | Execution Accuracy (EX)81.9 | 33 | 3d ago | |
| BIRD (test) | XiYan-SQL | EX75.63 | 32 | 3d ago | |
| Spider-Syn | DCG-SQL | Execution Accuracy (EX)78.7 | 26 | 3d ago | |
| Spider-DK | Think2SQL-14B | Execution Accuracy (EX)77.8 | 26 | 3d ago | |
| EHRSQL | DataFlow-Text2SQL-90K | Gre Score56.1 | 22 | 3d ago | |
| BIRD | Qwen-Plus | Total Execution Accuracy68.32 | 22 | 3d ago | |
| Geoquery | LIR+RIR | Exact Match Accuracy83 | 17 | 3d ago | |
| KaggleDBQA (test) | MNL | EA (%)64 | 14 | 3d ago | |
| NL2SQL | JsonTuning | Execution Accuracy53.2 | 14 | 3d ago | |
| BIRD (test dev) | MAC-SQL + GPT-4 | Execution Accuracy (EX)48.92 | 14 | 3d ago | |
| ATIS | LIR+RIR | Exact Match Accuracy47.8 | 13 | 3d ago | |
| Ambrosia | HEROSQL | AUPRC64.18 | 12 | 3d ago | |
| NL2SQL (test) | RASD(REST) | SR4.24 | 12 | 3d ago | |
| WikiSQL Fully-supervised (test) | Guo & Gao w. GRAPPA (MLM+SSP) | Execution Accuracy90.8 | 12 | 3d ago | |
| WikiSQL Fully-supervised (dev) | Guo & Gao w. GRAPPA (MLM) | Execution Accuracy91.4 | 12 | 3d ago | |
| GEOQUERY template (test) | LIRd+RIR | Accuracy0.83 | 12 | 3d ago | |
| ATIS template (test) | LIRd+RIR | Accuracy47.8 | 12 | 3d ago |