Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Text-to-SQL on Mini (dev)
Loading...
63.8
Execution Accuracy (EX)
PV-SQL
48.824
52.712
56.6
60.488
Apr 19, 2026
Execution Accuracy (EX)
Validation Score (VES)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Execution Accuracy (EX)
Validation Score (VES)
PV-SQL
Base LLM=GPT-4o
2026.04
63.8
74.63
TA-SQL
Base LLM=GPT-4o
2026.04
58.4
64.05
MAC-SQL
Base LLM=GPT-4o
2026.04
57.8
64.65
E-SQL
Base LLM=GPT-4o
2026.04
57.4
64.23
TS-SQL
Base LLM=GPT-4o
2026.04
55
59.14
XiYan-SQL
Base LLM=GPT-4o
2026.04
52.2
56.31
DIN-SQL
Base LLM=GPT-4o
2026.04
51
54.79
DAIL-SQL
Base LLM=GPT-4o
2026.04
50
50.84
GPT-4o
2026.04
49.4
54.57
Feedback
Search any
task
Search any
task