SIGNet: Semantic Instance Aided Unsupervised 3D Geometry Perception

About

Unsupervised learning for geometric perception (depth, optical flow, etc.) is of great interest to autonomous systems. Recent works on unsupervised learning have made considerable progress on perceiving geometry; however, they usually ignore the coherence of objects and perform poorly under scenarios with dark and noisy environments. In contrast, supervised learning algorithms, which are robust, require large labeled geometric dataset. This paper introduces SIGNet, a novel framework that provides robust geometry perception without requiring geometrically informative labels. Specifically, SIGNet integrates semantic information to make depth and flow predictions consistent with objects and robust to low lighting conditions. SIGNet is shown to improve upon the state-of-the-art unsupervised learning for depth prediction by 30% (in squared relative error). In particular, SIGNet improves the dynamic object class performance by 39% in depth prediction and 29% in flow prediction. Our code will be made available at https://github.com/mengyuest/SIGNet

Yue Meng, Yongxi Lu, Aman Raj, Samuel Sunarjo, Rui Guo, Tara Javidi, Gaurav Bansal, Dinesh Bharadia• 2018

Related benchmarks

Task	Dataset	Result
Monocular Depth Estimation	KITTI (Eigen)	Abs Rel0.133	523
Depth Estimation	KITTI (Eigen split)	RMSE5.181	291
Monocular Depth Estimation	KITTI 2015 (Eigen split)	Abs Rel0.133	95
Depth Prediction	KITTI original ground truth (test)	Abs Rel0.133	38
Optical Flow	KITTI non-occluded regions (Noc) 2015	End Point Error7.66	3
Optical Flow	KITTI 2015 (overall regions (All))	End Point Error13.91	3
Rigid Flow Prediction	KITTI non-occluded regions 2015 (train val)	EPE22.3819	2
Rigid Flow Prediction	KITTI overall regions 2015 (train/val)	End Point Error26.8465	2

Showing 8 of 8 rows

Other info

Code

Follow for update

@wizwand_team Discord