MeshWalker: Deep Mesh Understanding by Random Walks
About
Most attempts to represent 3D shapes for deep learning have focused on volumetric grids, multi-view images and point clouds. In this paper we look at the most popular representation of 3D shapes in computer graphics - a triangular mesh - and ask how it can be utilized within deep learning. The few attempts to answer this question propose to adapt convolutions & pooling to suit Convolutional Neural Networks (CNNs). This paper proposes a very different approach, termed MeshWalker, to learn the shape directly from a given mesh. The key idea is to represent the mesh by random walks along the surface, which "explore" the mesh's geometry and topology. Each walk is organized as a list of vertices, which in some manner imposes regularity on the mesh. The walk is fed into a Recurrent Neural Network (RNN) that "remembers" the history of the walk. We show that our approach achieves state-of-the-art results for two fundamental shape analysis tasks: shape classification and semantic segmentation. Furthermore, even a very small number of examples suffices for learning. This is highly important, since large datasets of meshes are difficult to acquire.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Shape Classification | ModelNet40 (test) | Accuracy92.3 | 227 | |
| 3D Shape Classification | ModelNet-40 | Accuracy92.3 | 41 | |
| Classification | ModelNet40 | Accuracy92.3 | 26 | |
| Mesh Segmentation | Human Body dataset | Accuracy94.8 | 20 | |
| Human part segmentation | SHREC07 Human Body (test) | Accuracy92.7 | 11 | |
| Mesh Segmentation | COSEG (test) | Vases98.7 | 9 | |
| Semantic segmentation | Maron original meshes (test) | Face Accuracy92.7 | 9 | |
| 3D Shape Classification | Cube Engraving (test) | Accuracy98.6 | 9 | |
| Classification | SHREC 11 (test) | Accuracy97.1 | 9 | |
| Classification | SHREC11 | Accuracy97.1 | 9 |