Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Robotic Planning on Comprehensive Scene 13
Loading...
100
SR
OR
92.72
94.61
96.5
98.39
Feb 9, 2026
SR
Average Steps
Average Time (s)
Average Tokens
Time to Success (Time-to-S)
Tokens per Success (Tokens-to-S)
Updated 4d ago
Evaluation Results
Method
Method
Links
SR
Average Steps
Average Time (s)
Average Tokens
Time to Success (Time-to-S)
Tokens per Success (Tokens-to-S)
OR
2026.02
100
8.7
20.03
21,900
20.03
21,900
ReAct
2026.02
99
12.9
109.5
173,000
110.6
57,000
MLDT
2026.02
98
8.3
22.54
20,800
23
21,600
Basic
2026.02
97
8.5
16.48
19,300
16.99
19,200
PlanORN
2026.02
94
13.2
51.4
91,700
54.68
92,700
ORN
2026.02
93
13.5
40.3
70,500
43.34
72,200
Feedback
Search any
task
Search any
task