3D Common Corruptions and Data Augmentation

About

We introduce a set of image transformations that can be used as corruptions to evaluate the robustness of models as well as data augmentation mechanisms for training neural networks. The primary distinction of the proposed transformations is that, unlike existing approaches such as Common Corruptions, the geometry of the scene is incorporated in the transformations -- thus leading to corruptions that are more likely to occur in the real world. We also introduce a set of semantic corruptions (e.g. natural object occlusions). We show these transformations are `efficient' (can be computed on-the-fly), `extendable' (can be applied on most image datasets), expose vulnerability of existing models, and can effectively make models more robust when employed as `3D data augmentation' mechanisms. The evaluations on several tasks and datasets suggest incorporating 3D information into benchmarking and training opens up a promising direction for robustness research.

O\u{g}uzhan Fatih Kar, Teresa Yeo, Andrei Atanov, Amir Zamir• 2022

Related benchmarks

Task	Dataset	Result
Surface Normal Prediction	NYU V2	Mean Error17.2	137
Surface Normal Estimation	NYU V2	Mean Angular Error17.2	96
Surface Normal Estimation	iBIMS-1	MAE18.2	67
Video Surface Normal Estimation	Sintel	Mean Angular Error40.5	32
Surface Normal Estimation	DIODE (test)	L1 Error22.5	24
Surface Normal Estimation	ScanNet Indoor	Mean Error16.2	18
Surface Normal Estimation	ScanNet Normal Benchmark (test)	Angle Error Threshold (11.25°)60.2	18
Surface Normal Estimation	Sintel Outdoor	Accuracy (11.25° Threshold)14.7	14
Transparent object normal estimation	TransNormal Synthetic (test)	Mean Angular Error8.2	13
Transparent object normal estimation	ClearGrasp Synthetic (test)	Mean Angular Error33.8	13

Showing 10 of 18 rows

Other info

Code

Follow for update

@wizwand_team Discord