Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction

About

3D semantic occupancy prediction is an important task for robust vision-centric autonomous driving, which predicts fine-grained geometry and semantics of the surrounding scene. Most existing methods leverage dense grid-based scene representations, overlooking the spatial sparsity of the driving scenes. Although 3D semantic Gaussian serves as an object-centric sparse alternative, most of the Gaussians still describe the empty region with low efficiency. To address this, we propose a probabilistic Gaussian superposition model which interprets each Gaussian as a probability distribution of its neighborhood being occupied and conforms to probabilistic multiplication to derive the overall geometry. Furthermore, we adopt the exact Gaussian mixture model for semantics calculation to avoid unnecessary overlapping of Gaussians. To effectively initialize Gaussians in non-empty region, we design a distribution-based initialization module which learns the pixel-aligned occupancy distribution instead of the depth of surfaces. We conduct extensive experiments on nuScenes and KITTI-360 datasets and our GaussianFormer-2 achieves state-of-the-art performance with high efficiency. Code: https://github.com/huang-yh/GaussianFormer.

Yuanhui Huang, Amonnut Thammatadatrakoon, Wenzhao Zheng, Yunpeng Zhang, Dalong Du, Jiwen Lu• 2024

Related benchmarks

TaskDatasetResultRank
3D Occupancy PredictionOcc3D-nuScenes (val)
mIoU13.4
144
3D Semantic Occupancy PredictionSurroundOcc (val)
mIoU20.8
36
3D Semantic Occupancy PredictionSurroundOcc-nuScenes (val)
IoU31.74
31
3D Semantic Occupancy PredictionSurroundOcc-nuScenes rainy scenario (val)
mIoU20.36
26
3D Semantic Occupancy PredictionSurroundOcc-nuScenes night scenario (val)
mIoU (Mean IoU)10.14
22
3D Semantic Occupancy PredictionnuScenes 1.0 (val)
IoU31.74
21
3D Semantic Occupancy PredictionSSCBench-KITTI-360 (test)
Overall IoU38.37
15
3D Semantic Occupancy PredictionSurroundOcc v1.0 (test)
IoU30.56
15
3D Occupancy PredictionNuScenes SurroundOcc v1.0 (test)
mIoU20.82
12
Showing 9 of 9 rows

Other info

Code

Follow for update