Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
NL-to-Scala on BIRD (Execution Accuracy)
Loading...
40.7
Execution Accuracy
Table-Specialist
8.772
17.061
25.35
33.639
Oct 16, 2024
Execution Accuracy
Updated 25d ago
Evaluation Results
Method
Method
Links
Execution Accuracy
Table-Specialist
Base Model=GPT-4, Trai...
2024.10
40.7
Table-Specialist
Base Model=GPT-3.5, Tr...
2024.10
36
Table-Specialist
Base Model=Llama3.1-8B...
2024.10
22.8
GPT-4
Base Model=GPT-4, Trai...
2024.10
18.9
GPT-3.5
Base Model=GPT-3.5, Tr...
2024.10
18.8
Llama3.1-8B
Base Model=Llama3.1-8B...
2024.10
10
Feedback
Search any
task
Search any
task