Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BIRD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-to-SQLBIRD (dev)
Execution Accuracy (EA)77.84
387
Text-to-SQLBIRD
Total Execution Accuracy97.59
68
Text-to-SQLBIRD
Execution Accuracy (EX)65.12
63
Text-to-SQLBIRD (test)
EX75.63
46
Text2SQLBIRD (dev)
Exec Acc (Greedy)65.8
44
Text-to-SQLBIRD (Non-Synthesized Matched Set)
ExM Accuracy93.15
32
Text-to-SQLBIRD Synthesized Matched Set
ExM Accuracy90.39
32
Image ClassificationBird
Accuracy85.1
29
Color Video CompletionBird color video sequence
PSNR25.382
28
Text-to-SQLBIRD
Accuracy69.1
27
SQL Semantic ValidationBIRD
AUPRC80.36
24
Text-to-SQLBIRD
Execution Accuracy (Llama-8B)31.8
21
SQL execution performanceBIRD n=1534
EM (1T)64.9
21
Table RetrievalBIRD union (test)
Precision54.4
20
Text-to-SQLBIRD
Execution Accuracy73
20
Schema linkingBIRD (dev)
SRR100
16
Text-to-SQLBird
Match Accuracy (MAT)6.67
15
Schema retrievalBIRD
Recall (R)100
15
SQL GenerationBIRD Original (dev)
Execution Accuracy (Simple)65.51
14
SQL GenerationBIRD Verified
Execution Accuracy (Simple)69.41
14
Text-to-SQLBIRD (test dev)
Execution Accuracy (EX)48.92
14
Text-to-SQLBIRD
Kendall's τ-0.11
12
Table SelectionBIRD 2023 (test)
Avg #tables5.3
12
Table RetrievalBIRD
Precision (P)57.3
11
SQL generationBird
Pass@160.6
11
Showing 25 of 62 rows