Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Semantic Parsing on NL2Bash
Loading...
45.19
Accuracy
CLG
22.0292
28.0421
34.055
40.0679
Jun 5, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
CLG
Model=Qwen2.5-72B, Sho...
2025.06
45.19
BGE-KMeans
Model=Qwen2.5-72B, Sho...
2025.06
40.86
BM25-Major
Model=Qwen2.5-72B, Sho...
2025.06
38.11
Latent-Bayesian
Model=Qwen2.5-72B, Sho...
2025.06
37.36
Best-of-N
Model=Qwen2.5-72B, Sho...
2025.06
37.13
Random
Model=Qwen2.5-72B, Sho...
2025.06
33.28
Best-of-N
Model=Llama3-70B, Shot...
2025.06
29.76
Random
Model=Llama3-70B, Shot...
2025.06
29.72
CLG
Model=Llama3-70B, Shot...
2025.06
29.59
Latent-Bayesian
Model=Llama3-70B, Shot...
2025.06
29.2
BGE-KMeans
Model=Llama3-70B, Shot...
2025.06
28.39
BM25-Major
Model=Llama3-70B, Shot...
2025.06
27.92
EPR-KMeans
Model=Qwen2.5-72B, Sho...
2025.06
26.53
EPR-KMeans
Model=Llama3-70B, Shot...
2025.06
22.92
Feedback
Search any
task
Search any
task