Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long-horizon Manipulation on RLBench Simulated
Loading...
92.2
Success Rate (PutRubbish InBin)
SADP
77.224
81.112
85
88.888
May 16, 2026
Success Rate (PutRubbish InBin)
Success Rate (SlideBlock ToTarget)
Success Rate (MeatOff Grill)
Success Rate (Open Drawer)
Success Rate (Close Drawer)
Success Rate (PutItem InDrawer)
Average Success Rate
Updated 15d ago
Evaluation Results
Method
Method
Links
Success Rate (PutRubbish InBin)
Success Rate (SlideBlock ToTarget)
Success Rate (MeatOff Grill)
Success Rate (Open Drawer)
Success Rate (Close Drawer)
Success Rate (PutItem InDrawer)
Average Success Rate
SADP
2026.05
92.2
85.6
78.9
75
93.9
71.1
82.8
TARAD
2026.05
88.9
85
73.3
78.3
92.8
73.9
82
DP3
2026.05
77.8
66.1
51.1
55
74.4
46.1
61.8
Feedback
Search any
task
Search any
task