Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Monte Carlo Convolution for Learning on Non-Uniformly Sampled Point Clouds

About

Deep learning systems extensively use convolution operations to process input data. Though convolution is clearly defined for structured data such as 2D images or 3D volumes, this is not true for other data types such as sparse point clouds. Previous techniques have developed approximations to convolutions for restricted conditions. Unfortunately, their applicability is limited and cannot be used for general point clouds. We propose an efficient and effective method to learn convolutions for non-uniformly sampled point clouds, as they are obtained with modern acquisition techniques. Learning is enabled by four key novelties: first, representing the convolution kernel itself as a multilayer perceptron; second, phrasing convolution as a Monte Carlo integration problem, third, using this notion to combine information from multiple samplings at different levels; and fourth using Poisson disk sampling as a scalable means of hierarchical point cloud learning. The key idea across all these contributions is to guarantee adequate consideration of the underlying non-uniform sample distribution function from a Monte Carlo perspective. To make the proposed concepts applicable to real-world tasks, we furthermore propose an efficient implementation which significantly reduces the GPU memory required during the training process. By employing our method in hierarchical network architectures we can outperform most of the state-of-the-art networks on established point cloud segmentation, classification and normal estimation benchmarks. Furthermore, in contrast to most existing approaches, we also demonstrate the robustness of our method with respect to sampling variations, even when training with uniformly sampled data only. To support the direct application of these concepts, we provide a ready-to-use TensorFlow implementation of these layers at https://github.com/viscom-ulm/MCCNN

Pedro Hermosilla, Tobias Ritschel, Pere-Pau V\'azquez, \`Alvar Vinacua, Timo Ropinski• 2018

Related benchmarks

TaskDatasetResultRank
3D Shape ClassificationModelNet40 (test)
Accuracy90.9
227
3D Object Part SegmentationShapeNet Part (test)
mIoU85.9
114
3D Semantic SegmentationScanNet v2 (test)
mIoU63.3
110
Semantic segmentationScanNet (test)
mIoU63.3
59
Normal estimationModelNet40
Error0.16
12
Showing 5 of 5 rows

Other info

Follow for update