Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Autolabeling 3D Objects with Differentiable Rendering of SDF Shape Priors

About

We present an automatic annotation pipeline to recover 9D cuboids and 3D shapes from pre-trained off-the-shelf 2D detectors and sparse LIDAR data. Our autolabeling method solves an ill-posed inverse problem by considering learned shape priors and optimizing geometric and physical parameters. To address this challenging problem, we apply a novel differentiable shape renderer to signed distance fields (SDF), leveraged together with normalized object coordinate spaces (NOCS). Initially trained on synthetic data to predict shape and coordinates, our method uses these predictions for projective and geometric alignment over real samples. Moreover, we also propose a curriculum learning strategy, iteratively retraining on samples of increasing difficulty in subsequent self-improving annotation rounds. Our experiments on the KITTI3D dataset show that we can recover a substantial amount of accurate cuboids, and that these autolabels can be used to train 3D vehicle detectors with state-of-the-art results.

Sergey Zakharov, Wadim Kehl, Arjun Bhargava, Adrien Gaidon• 2019

Related benchmarks

TaskDatasetResultRank
3D Object DetectionKITTI car (val)--
62
3D Object DetectionKITTI car category (val)
AP BEV Easy50.51
21
Cuboid AutolabelingKITTI Easy
BEV Accuracy @ 0.580.7
18
Monocular 3D Object DetectionKITTI-360 (val)
AP BEV @0.3 (Easy)55.55
7
Monocular 3D Object DetectionKITTI-360 (test)
AP BEV @ IoU 0.3 (Easy)48.16
7
Cuboid AutolabelingKITTI Moderate
BEV@0.563.36
6
3D Object DetectionKITTI MV3D (val)
AP3D (IoU=0.5, Easy)94.8
4
Bird's Eye View (BEV) Object DetectionKITTI MV3D (val)
BEV AP @ 0.5 (Easy)95.1
4
3D Object Autolabeling QualityKITTI Moderate (train)
BEV @ 0.563.36
3
2D Object DetectionKITTI MV3D (val)
2D AP @ 0.5 (Easy)96.7
2
Showing 10 of 11 rows

Other info

Code

Follow for update