Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Language-to-Code Generation on WikiTQ official (test)
Loading...
74.6
Execution Accuracy
Oracle
46.52
53.81
61.1
68.39
Feb 16, 2023
Execution Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Execution Accuracy
Oracle
generator_model=code-d...
2023.02
74.6
LEVER
generator_model=code-d...
2023.02
62.9
OmniTab
finetuning=true
2023.02
62.8
Codex Binder
finetuning=false
2023.02
61.9
TaCube
finetuning=true
2023.02
59.6
TaPEX*
finetuning=true
2023.02
57.5
Codex SQL
finetuning=false
2023.02
55.1
EP + Voting
generator_model=code-d...
2023.02
53.6
EP + ML
generator_model=code-d...
2023.02
52.5
Greedy
generator_model=code-d...
2023.02
50.9
ML
generator_model=code-d...
2023.02
50.9
Codex QA
finetuning=false
2023.02
47.6
Feedback
Search any
task
Search any
task