Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Hallucination Evaluation on HALLUBENCH
Loading...
71.1
Accuracy
GeoFocus
56.956
60.628
64.3
67.972
Feb 9, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GeoFocus
Model Scale=7B
2026.02
71.1
GRPO
Model Scale=7B
2026.02
68.7
Baseline
Model Scale=7B
2026.02
68
GeoFocus
Model Scale=3B
2026.02
64.7
GRPO
Model Scale=3B
2026.02
63.3
Baseline
Model Scale=3B
2026.02
60.5
SFT
Model Scale=3B
2026.02
57.8
SFT
Model Scale=7B
2026.02
57.5
Feedback
Search any
task
Search any
task