Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reasoning on GeoQuery
Loading...
84.64
Accuracy
BGE-KMeans
25.2144
40.6422
56.07
71.4978
Jun 5, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
BGE-KMeans
Model=Llama3-70B, Shot...
2025.06
84.64
CLG
Model=Llama3-70B, Shot...
2025.06
84.64
Best-of-N
Model=Llama3-70B, Shot...
2025.06
80.71
EPR-KMeans
Model=Llama3-70B, Shot...
2025.06
78.21
Random
Model=Llama3-70B, Shot...
2025.06
73.36
EPR-KMeans
Model=Qwen2.5-72B, Sho...
2025.06
62.86
CLG
Model=Qwen2.5-72B, Sho...
2025.06
62.5
Latent-Bayesian
Model=Llama3-70B, Shot...
2025.06
62.14
BGE-KMeans
Model=Qwen2.5-72B, Sho...
2025.06
61.79
Best-of-N
Model=Qwen2.5-72B, Sho...
2025.06
61.07
Random
Model=Qwen2.5-72B, Sho...
2025.06
57.29
BM25-Major
Model=Qwen2.5-72B, Sho...
2025.06
52.5
BM25-Major
Model=Llama3-70B, Shot...
2025.06
43.21
Latent-Bayesian
Model=Qwen2.5-72B, Sho...
2025.06
27.5
Feedback
Search any
task
Search any
task