Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reasoning accuracy on PRONTOQA 5-hop
Loading...
81
Accuracy
AVI
44.6
54.05
63.5
72.95
May 17, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
AVI
Model=Qwen 8B
2025.05
81
AVI
Model=Qwen 4B
2025.05
77
No Correction
Model=Qwen 4B
2025.05
59
No Correction
Model=Qwen 8B
2025.05
52
Self Correction
Model=Qwen 4B
2025.05
48
Self Correction
Model=Qwen 8B
2025.05
46
Feedback
Search any
task
Search any
task