Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agentic Reasoning on Sokoban (test)
Loading...
38.3
Success Rate
RETROAGENT
1.172
10.811
20.45
30.089
Mar 9, 2026
Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Rate
RETROAGENT
Evaluation Protocol=RL...
2026.03
38.3
RETROAGENT
Evaluation Protocol=RL...
2026.03
32.6
GiGPO
Evaluation Protocol=Fi...
2026.03
21.9
LAMER
Evaluation Protocol=Fi...
2026.03
14.3
GRPO w/ EMPG
Evaluation Protocol=Fi...
2026.03
12.8
GRPO
Evaluation Protocol=Fi...
2026.03
11.2
RLOO
Evaluation Protocol=Fi...
2026.03
9.9
EvolveR
Evaluation Protocol=Fi...
2026.03
6
Reflexion
Evaluation Protocol=Pr...
2026.03
4.3
MemRL
Evaluation Protocol=Fi...
2026.03
4.2
ReAct
Evaluation Protocol=Pr...
2026.03
3.9
Qwen-2.5-7B-Instruct
Evaluation Protocol=Ze...
2026.03
2.6
Feedback
Search any
task
Search any
task