Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
NL-to-Scala on WikiTQ
Loading...
47.6
Execution Accuracy
Table-Specialist
2.568
14.259
25.95
37.641
Oct 16, 2024
Execution Accuracy
Updated 25d ago
Evaluation Results
Method
Method
Links
Execution Accuracy
Table-Specialist
Base Model=GPT-4, Trai...
2024.10
47.6
Table-Specialist
Base Model=GPT-3.5, Tr...
2024.10
42.6
GPT-4
Base Model=GPT-4, Trai...
2024.10
19.8
Table-Specialist
Base Model=Llama3.1-8B...
2024.10
18.8
GPT-3.5
Base Model=GPT-3.5, Tr...
2024.10
10.9
Llama3.1-8B
Base Model=Llama3.1-8B...
2024.10
4.3
Feedback
Search any
task
Search any
task