Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Language-to-Code Generation on WikiTQ official (test)
Loading...
74.6
Execution Accuracy
Oracle
46.52
53.81
61.1
68.39
Feb 16, 2023
Execution Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Execution Accuracy
Oracle
generator_model=code-d...
2023.02
74.6
LEVER
generator_model=code-d...
2023.02
62.9
OmniTab
finetuning=true
2023.02
62.8
Codex Binder
finetuning=false
2023.02
61.9
TaCube
finetuning=true
2023.02
59.6
TaPEX*
finetuning=true
2023.02
57.5
Codex SQL
finetuning=false
2023.02
55.1
EP + Voting
generator_model=code-d...
2023.02
53.6
EP + ML
generator_model=code-d...
2023.02
52.5
Greedy
generator_model=code-d...
2023.02
50.9
ML
generator_model=code-d...
2023.02
50.9
Codex QA
finetuning=false
2023.02
47.6
Feedback
Search any
task
Search any
task