Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Text-to-SQL Parsing on Ambrosia Unambiguous (test)
Loading...
88.7
Recall
IntentRL
19.332
37.341
55.35
73.359
Nov 13, 2025
Recall
Precision
Updated 4d ago
Evaluation Results
Method
Method
Links
Recall
Precision
IntentRL
Model=4B Instruct, Out...
2025.11
88.7
74.7
SFT
Model=4B Instruct, Out...
2025.11
78
73.4
SFT
Model=4B Instruct, Out...
2025.11
77.7
74.7
IntentPrompt
Model=235B MoE Instruc...
2025.11
74.1
57.8
IntentPrompt
Model=235B MoE Instruc...
2025.11
69.3
64.1
IntentPrompt
Model=235B MoE Thinkin...
2025.11
61.8
48.4
IntentPrompt
Model=4B Instruct, Out...
2025.11
58.3
57.2
IntentPrompt
Model=4B Thinking, Out...
2025.11
58.3
56.1
IntentPrompt
Model=235B MoE Thinkin...
2025.11
53.7
35.9
IntentPrompt
Model=4B Thinking, Out...
2025.11
25.2
19.7
IntentPrompt
Model=4B Instruct, Out...
2025.11
22
18.8
Feedback
Search any
task
Search any
task