Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Arithmetic Reasoning on Game of 24 95 (test)
Loading...
100
Success Rate
MGRS
0.576
26.388
52.2
78.012
Nov 28, 2025
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
MGRS
LLM invoked=7.3
2025.11
100
FoT(n=4)
LLM invoked=20.6
2025.11
93.7
XoT
LLM invoked=1.8
2025.11
85.4
BoT
LLM invoked=3.0
2025.11
82.4
ToT
LLM invoked=13.7
2025.11
74
AoT
LLM invoked=4.0
2025.11
11.6
GoT(k=1)
LLM invoked=7.0
2025.11
5.3
CoT
LLM invoked=1.0
2025.11
4.4
CoT-SC
LLM invoked=10.0
2025.11
4.4
Feedback
Search any
task
Search any
task