Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Planning on GraSIF BEHAVIOR-1K
Loading...
61
SR
SayPlan Lite
31.88
39.44
47
54.56
Dec 24, 2025
SR
APP
TPA
Updated 4d ago
Evaluation Results
Method
Method
Links
SR
APP
TPA
SayPlan Lite
2025.12
61
76
524
LookPlanGraph
2025.12
60
77
1,472
ReAct
2025.12
47
61
1,713
LLM-as-P
2025.12
39
53
178
SayPlan
2025.12
36
43
1,888
LLM+P
2025.12
33
37
160
Feedback
Search any
task
Search any
task