Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DROID

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-step manipulationDROID Tabletop Multi-step tasks
Success Rate98
18
Semantic reasoning manipulationDROID Tabletop Semantic tasks
Success Rate26
18
Rearrangement with distractorsDROID Tabletop Distractor tasks
Success Rate27
18
Pick-and-placeDROID Tabletop Simple tasks
Success Rate27
12
Video DepthDROID
Abs Rel0.223
8
Robot Policy LearningDROID Franka Panda
Average Success Rate47.4
7
Video-to-Video GenerationDroid (test)
VBench0.81
6
Interactive long-trajectory generationDROID (val)
PSNR23.56
6
Video Frame Rank-CorrelationDROID
VOC Rank-Correlation (Sparse)0.99
6
Autoregressive rolloutDROID External Camera (val)
SSIM86
5
Camera TrackingDROID-W
Error Rate (Downtown 1)0.1
5
Temporal Value EstimationDROID (test)
VOC+93.67
5
SegmentationDROID internal held-out
Dice Coefficient76.7
5
Monocular Depth EstimationDROID (unseen domain)
Abs Rel0.237
4
Dynamic Affordance PredictionDROID 70/30 (test)
Open Microwave MAE37
4
Video generationDROID (Unseen Scene)
PSNR19.73
4
Video generationDROID Unseen Camera Viewpoint
PSNR20.87
4
Video generationDROID (In-Domain)
PSNR22.89
4
Multi-view Video GenerationDroid 300 cases (test)
FID39.97
3
Autoregressive rolloutDROID Wrist Camera (val)
SSIM67
2
Articulation EstimationDROID 19 articulated object manipulation demos
Prismatic Joint Angle Error (deg)7.15
2
Showing 21 of 21 rows