Share your thoughts, 1 month free Claude Pro on usSee more

Language-to-Code Generation on WikiTQ official (test)

74.6Execution Accuracy

Oracle

Updated 4mo ago

Evaluation Results

Method	Links
Oracle 2023.02		74.6
LEVER 2023.02		62.9
OmniTab 2023.02		62.8
Codex Binder 2023.02		61.9
TaCube 2023.02		59.6
TaPEX* 2023.02		57.5
Codex SQL 2023.02		55.1
EP + Voting 2023.02		53.6
EP + ML 2023.02		52.5
Greedy 2023.02		50.9
ML 2023.02		50.9
Codex QA 2023.02		47.6