Next-Best View Policy for 3D Reconstruction
About
Manually selecting viewpoints or using commonly available flight planners like circular path for large-scale 3D reconstruction using drones often results in incomplete 3D models. Recent works have relied on hand-engineered heuristics such as information gain to select the Next-Best Views. In this work, we present a learning-based algorithm called Scan-RL to learn a Next-Best View (NBV) Policy. To train and evaluate the agent, we created Houses3K, a dataset of 3D house models. Our experiments show that using Scan-RL, the agent can scan houses with fewer number of steps and a shorter distance compared to our baseline circular path. Experimental results also demonstrate that a single NBV policy can be used to scan multiple houses including those that were not seen during training. The link to Scan-RL is available at https://github.com/darylperalta/ScanRL and Houses3K dataset can be found at https://github.com/darylperalta/Houses3K.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Next-Best-View 3D Reconstruction | Three datasets (Objaverse, Houses3K, and others) | Mean CR80 | 12 | |
| Point Cloud Reconstruction | OmniObject3D | Coverage Ratio (CR)80 | 7 | |
| Point Cloud Reconstruction | Objaverse | Coverage Rate (CR)81 | 7 | |
| Point Cloud Reconstruction | Houses3K | Coverage Rate (CR)81 | 7 | |
| Point Cloud Reconstruction | OmniObject3D, Objaverse, Houses3K Overall | Coverage Ratio (CR)81 | 7 |