Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multiview Compressive Coding for 3D Reconstruction

About

A central goal of visual recognition is to understand objects and scenes from a single image. 2D recognition has witnessed tremendous progress thanks to large-scale learning and general-purpose representations. Comparatively, 3D poses new challenges stemming from occlusions not depicted in the image. Prior works try to overcome these by inferring from multiple views or rely on scarce CAD models and category-specific priors which hinder scaling to novel settings. In this work, we explore single-view 3D reconstruction by learning generalizable representations inspired by advances in self-supervised learning. We introduce a simple framework that operates on 3D points of single objects or whole scenes coupled with category-agnostic large-scale training from diverse RGB-D videos. Our model, Multiview Compressive Coding (MCC), learns to compress the input appearance and geometry to predict the 3D structure by querying a 3D-aware decoder. MCC's generality and efficiency allow it to learn from large-scale and diverse data sources with strong generalization to novel objects imagined by DALL$\cdot$E 2 or captured in-the-wild with an iPhone.

Chao-Yuan Wu, Justin Johnson, Jitendra Malik, Christoph Feichtenhofer, Georgia Gkioxari• 2023

Related benchmarks

TaskDatasetResultRank
3D Shape ReconstructionOmniObject3D
CD0.551
17
3D Asset ReconstructionToys4k
CD0.3299
11
3D Shape ReconstructionPix3D
FS@10.1754
10
Shape ReconstructionCO3D seen categories v2
Abs Error1.46
9
Shape ReconstructionCO3D unseen categories v2
Abs Error1.17
9
3D Shape ReconstructionOcrtoc3D (test)
FS@10.1994
7
3D Object ReconstructionCO3D 10 held-out categories v2
Accuracy17.2
6
Scene-level 3D ReconstructionGibson 1-view (test)
Visible Quality56.4
4
Scene-level 3D ReconstructionGibson 3-views (test)
Visible Score62.02
4
Scene-level 3D ReconstructionGibson 5-views (test)
Visible Score64.44
4
Showing 10 of 12 rows

Other info

Code

Follow for update