Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

BIRD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-to-SQLBIRD (dev)
Execution Accuracy (EA)74.46
217
Text2SQLBIRD (dev)
Exec Acc (Greedy)65.3
37
Text-to-SQLBIRD (test)
EX75.63
32
Image ClassificationBird
Accuracy85.1
29
SQL Semantic ValidationBIRD
AUPRC80.36
24
Text-to-SQLBIRD
Total Execution Accuracy68.32
22
SQL execution performanceBIRD n=1534
EM (1T)64.9
21
SQL GenerationBIRD Original (dev)
Execution Accuracy (Simple)65.51
14
SQL GenerationBIRD Verified
Execution Accuracy (Simple)69.41
14
Text-to-SQLBIRD (test dev)
Execution Accuracy (EX)48.92
14
Table SelectionBIRD 2023 (test)
Avg #tables5.3
12
Text-to-SQLBIRD (holdout test)
Execution Accuracy73
11
Image Super-ResolutionBird
PSNR25.2998
7
End-to-end Question AnsweringBird
Accuracy20.6
6
Object DetectionBird
Accuracy94.9
5
DB RoutingBIRD Route
R@179.62
5
Coding AgentBird
Pass@143.83
5
RetrievalBird
Precision42.7
3
ClassificationBIRD
Accuracy72
3
Text-to-SQLBIRD official (test)
Total Accuracy73.67
2
Gram matrix computationBird
Metric-
0
Showing 21 of 21 rows