Generative and Discriminative Voxel Modeling with Convolutional Neural Networks
About
When working with three-dimensional data, choice of representation is key. We explore voxel-based models, and present evidence for the viability of voxellated representations in applications including shape modeling and object classification. Our key contributions are methods for training voxel-based variational autoencoders, a user interface for exploring the latent space learned by the autoencoder, and a deep convolutional neural network architecture for object classification. We address challenges unique to voxel-based representations, and empirically evaluate our models on the ModelNet benchmark, where we demonstrate a 51.5% relative improvement in the state of the art for object classification.
Andrew Brock, Theodore Lim, J.M. Ritchie, Nick Weston• 2016
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Point Cloud Classification | ModelNet40 (test) | -- | 297 | |
| 3D Shape Classification | ModelNet40 (test) | Accuracy95.5 | 227 | |
| Object Classification | ModelNet40 (test) | Accuracy95.54 | 180 | |
| Classification | ModelNet40 (test) | Accuracy91.33 | 99 | |
| 3D shape recognition | ModelNet10 (test) | Accuracy97.1 | 64 | |
| 3D Object Classification | ModelNet10 (test) | Mean Class Accuracy97.14 | 57 | |
| Object Classification | ModelNet10 (test) | Accuracy97.14 | 46 | |
| 3D Shape Classification | ModelNet-40 | Accuracy91.33 | 41 | |
| 3D shape recognition | ModelNet10 | Accuracy93.61 | 23 | |
| Shape classification | ModelNet40 1.0 (test) | OA95.5 | 15 |
Showing 10 of 11 rows