Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Planning on Sokoban Grid
Loading...
63
Validity Rate
L-ICL
0.6
16.8
33
49.2
Jan 30, 2026
Validity Rate
Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Validity Rate
Success Rate
L-ICL
Number of training exa...
2026.01
63
49
L-ICL
Number of training exa...
2026.01
62
44
RAG-ICL
Context size (characte...
2026.01
25
10
ReAct
Inference feedback=Ora...
2026.01
21
13
L-ICL
Number of training exa...
2026.01
21
17
ReAct
Prompting strategy=ReA...
2026.01
19
12
RAG-ICL
Context size (characte...
2026.01
17
4
Zero-Shot
Input representation (...
2026.01
15
0
Self-Refine
Reasoning samples (k)=...
2026.01
13
8
Self-Consistency
Reasoning samples (k)=...
2026.01
10
5
L-ICL
Number of training exa...
2026.01
10
8
ToT
Prompting strategy=ToT...
2026.01
3
2
Feedback
Search any
task
Search any
task