Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

3D Common Corruptions and Data Augmentation

About

We introduce a set of image transformations that can be used as corruptions to evaluate the robustness of models as well as data augmentation mechanisms for training neural networks. The primary distinction of the proposed transformations is that, unlike existing approaches such as Common Corruptions, the geometry of the scene is incorporated in the transformations -- thus leading to corruptions that are more likely to occur in the real world. We also introduce a set of semantic corruptions (e.g. natural object occlusions). We show these transformations are `efficient' (can be computed on-the-fly), `extendable' (can be applied on most image datasets), expose vulnerability of existing models, and can effectively make models more robust when employed as `3D data augmentation' mechanisms. The evaluations on several tasks and datasets suggest incorporating 3D information into benchmarking and training opens up a promising direction for robustness research.

O\u{g}uzhan Fatih Kar, Teresa Yeo, Andrei Atanov, Amir Zamir• 2022

Related benchmarks

TaskDatasetResultRank
Surface Normal PredictionNYU V2
Mean Error17.2
100
Surface Normal EstimationDIODE (test)
L1 Error22.5
24
Surface Normal EstimationScanNet Normal Benchmark (test)
Angle Error Threshold (11.25°)60.2
18
Transparent object normal estimationTransNormal Synthetic (test)
Mean Angular Error8.2
13
Transparent object normal estimationClearGrasp Synthetic (test)
Mean Angular Error33.8
13
Transparent object normal estimationClearPose Real-World (test)
Mean Angular Error51.7
13
Video Surface Normal EstimationSintel
Mean Angular Error40.5
12
Surface Normal EstimationTaskonomy 2DCC (test)
L1 Error5.29
7
Surface Normal EstimationTaskonomy 3DCC (test)
L1 Error5.35
7
Surface Normal EstimationTaskonomy AE (test)
L1 Error4.94
7
Showing 10 of 14 rows

Other info

Code

Follow for update