Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reasoning accuracy on PRONTOQA 3-hop
Loading...
87
Accuracy
AVI
52.68
61.59
70.5
79.41
May 17, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
AVI
Model=Qwen 8B
2025.05
87
AVI
Model=Qwen 4B
2025.05
68
No Correction
Model=Qwen 4B
2025.05
60
No Correction
Model=Qwen 8B
2025.05
54
Self Correction
Model=Qwen 4B
2025.05
54
Self Correction
Model=Qwen 8B
2025.05
54
Feedback
Search any
task
Search any
task