Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Arithmetic Reasoning on Game of 24 95 (test)
Loading...
100
Success Rate
MGRS
0.576
26.388
52.2
78.012
Nov 28, 2025
Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Rate
MGRS
LLM invoked=7.3
2025.11
100
FoT(n=4)
LLM invoked=20.6
2025.11
93.7
XoT
LLM invoked=1.8
2025.11
85.4
BoT
LLM invoked=3.0
2025.11
82.4
ToT
LLM invoked=13.7
2025.11
74
AoT
LLM invoked=4.0
2025.11
11.6
GoT(k=1)
LLM invoked=7.0
2025.11
5.3
CoT
LLM invoked=1.0
2025.11
4.4
CoT-SC
LLM invoked=10.0
2025.11
4.4
Feedback
Search any
task
Search any
task