Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Planning on GraSIF (RobotHow)
Loading...
89
SR
ReAct
27.64
43.57
59.5
75.43
Dec 24, 2025
SR
APP
TPA
Updated 4d ago
Evaluation Results
Method
Method
Links
SR
APP
TPA
ReAct
2025.12
89
91
1,322
LookPlanGraph
2025.12
87
89
2,653
SayPlan
2025.12
86
87
5,576
SayPlan Lite
2025.12
84
89
4,641
LLM-as-P
2025.12
44
51
3,417
LLM+P
2025.12
30
38
5,396
Feedback
Search any
task
Search any
task