Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Text-to-SQL on BIRD-SQL StreamBench
Loading...
53.5
Accuracy (Simple)
ACE
46.116
48.033
49.95
51.867
Oct 6, 2025
Accuracy (Simple)
Accuracy (Moderate)
Accuracy (Challenging)
Average Accuracy
Updated 19d ago
Evaluation Results
Method
Method
Links
Accuracy (Simple)
Accuracy (Moderate)
Accuracy (Challenging)
Average Accuracy
ACE
Base LLM=DeepSeek-V3.1...
2025.10
53.5
50.7
56.6
52.9
GEPA
Base LLM=DeepSeek-V3.1...
2025.10
51.6
51.9
57.2
52.2
Base LLM
Base LLM=DeepSeek-V3.1...
2025.10
46.4
48.2
55.1
47.8
Feedback
Search any
task
Search any
task