| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| BIRD (dev) | CHASE-SQL | Execution Accuracy (EA)74.46 | 251 | 11d ago | |
| Spider (test) | XiYan-SQL | Execution Accuracy89.65 | 162 | 11d ago | |
| Spider (dev) | MCS-SQL + GPT-4 | EX (All)89.5 | 100 | 9d ago | |
| Spider 1.0 (dev) | MAC-SQL | Exact Match Accuracy86.75 | 92 | 1mo ago | |
| Spider | IQuest-Coder-V1-40B-Instruct | Exec Acc (All)92.2 | 91 | 24d ago | |
| Spider 1.0 (test) | CHESS+GPT-4 | EM Acc (Overall)87.2 | 91 | 1mo ago | |
| BIRD | IQuest-Coder-V1-40B-Instruct | Total Execution Accuracy70.5 | 64 | 3d ago | |
| LogicCat | IESR | Exact Match24.28 | 58 | 1mo ago | |
| BIRD (test) | XiYan-SQL | EX75.63 | 46 | 11d ago | |
| EHRSQL | SkillTrojan | Execution Accuracy85.2 | 37 | 9d ago | |
| Archer (dev) | IESR | Execution Accuracy37.28 | 36 | 1mo ago | |
| Text-to-SQL Multi-sharded | RPOTD+replay | Functional Accuracy60.7 | 35 | 8d ago | |
| Spider-Realistic | DCG-SQL | Execution Accuracy (EX)81.9 | 33 | 1mo ago | |
| Spider-Syn | MTIR-SQL-4B | Gre78.6 | 32 | 1mo ago | |
| Spider-DK | SQL-TRAIL | Gre Score76.8 | 32 | 1mo ago | |
| BIRD | Agentic SQL + CSMR + ATR | Accuracy69.1 | 27 | 24d ago | |
| BiomedSQL (test) | Execution Accuracy (EX)90 | 24 | 1mo ago | ||
| Spider Lite 2.0 | SquRL-7B | Execution Accuracy (EX)49.18 | 24 | 1mo ago | |
| Spider | Vanilla SD | AVGLEN1.07 | 22 | 1mo ago | |
| EHRSQL | DataFlow-Text2SQL-90K | Gre Score56.1 | 22 | 1mo ago | |
| BIRD-SQL Mini (dev) | MultiGA (Ensemble GA) | Average Accuracy70.5 | 17 | 15d ago | |
| Geoquery | LIR+RIR | Exact Match Accuracy83 | 17 | 1mo ago | |
| KaggleDBQA (test) | MNL | EA (%)64 | 14 | 1mo ago | |
| NL2SQL | JsonTuning | Execution Accuracy53.2 | 14 | 1mo ago | |
| BIRD (test dev) | MAC-SQL + GPT-4 | Execution Accuracy (EX)48.92 | 14 | 1mo ago |