Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Tidy-up-desk on Real-robot tabletop tasks 15 rollouts
Loading...
73.3
Success Rate (%)
SkiP
10.9
27.1
43.3
59.5
May 15, 2026
Success Rate (%)
Steps
Time (min:sec)
Updated 16d ago
Evaluation Results
Method
Method
Links
Success Rate (%)
Steps
Time (min:sec)
SkiP
Backbone=π0.5, Fine-tu...
2026.05
73.3
207.4
2
Base
Backbone=π0.5, Fine-tu...
2026.05
66.7
250.7
2
CoA
Backbone=π0.5, Fine-tu...
2026.05
40
232.4
2
KF-only
Backbone=π0.5, Fine-tu...
2026.05
13.3
286.8
2
Feedback
Search any
task
Search any
task