Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reasoning accuracy on PRONTOQA 5-hop
Loading...
97.8
Accuracy
LogicAgent
43.928
57.914
71.9
85.886
May 17, 2025
Jun 8, 2025
Jul 1, 2025
Jul 23, 2025
Aug 15, 2025
Sep 6, 2025
Sep 29, 2025
Accuracy
Updated 11d ago
Evaluation Results
Method
Method
Links
Accuracy
LogicAgent
Type=AR+SR, Base Model...
2025.09
97.8
SymbCoT
Type=SR, Base Model=Qw...
2025.09
95.2
Aristotle
Type=AR+SR, Base Model...
2025.09
94.8
CoT
Type=LR, Base Model=Qw...
2025.09
92.4
Logic-LM
Type=SR, Base Model=Qw...
2025.09
91.89
ToT
Type=AR, Base Model=Qw...
2025.09
82.5
Naive
Type=LR, Base Model=Qw...
2025.09
82
AVI
Model=Qwen 8B
2025.05
81
CR
Type=AR, Base Model=Qw...
2025.09
80.2
AVI
Model=Qwen 4B
2025.05
77
No Correction
Model=Qwen 4B
2025.05
59
No Correction
Model=Qwen 8B
2025.05
52
Self Correction
Model=Qwen 4B
2025.05
48
Self Correction
Model=Qwen 8B
2025.05
46
Feedback
Search any
task
Search any
task