Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GaussianFlowOcc: Sparse and Weakly Supervised Occupancy Estimation using Gaussian Splatting and Temporal Flow

About

Occupancy estimation has become a prominent task in 3D computer vision, particularly within the autonomous driving community. In this paper, we present a novel approach to occupancy estimation, termed GaussianFlowOcc, which is inspired by Gaussian Splatting and replaces traditional dense voxel grids with a sparse 3D Gaussian representation. Our efficient model architecture based on a Gaussian Transformer significantly reduces computational and memory requirements by eliminating the need for expensive 3D convolutions used with inefficient voxel-based representations that predominantly represent empty 3D spaces. GaussianFlowOcc effectively captures scene dynamics by estimating temporal flow for each Gaussian during the overall network training process, offering a straightforward solution to a complex problem that is often neglected by existing methods. Moreover, GaussianFlowOcc is designed for scalability, as it employs weak supervision and does not require costly dense 3D voxel annotations based on additional data (e.g., LiDAR). Through extensive experimentation, we demonstrate that GaussianFlowOcc significantly outperforms all previous methods for weakly supervised occupancy estimation on the nuScenes dataset while featuring an inference speed that is 50 times faster than current SOTA.

Simon Boeder, Fabian Gigengack, Benjamin Risse• 2025

Related benchmarks

TaskDatasetResultRank
3D Occupancy PredictionOcc3D-nuScenes (val)
mIoU17.1
213
3D Semantic Occupancy PredictionOcc3D
RayIoU16.2
40
3D Semantic Occupancy PredictionOcc3D-nuScenes v1.0 (val)
mIoU46.9
26
Occupancy EstimationOpenOcc v2
RayIoU17.1
12
Semantic Occupancy EstimationOcc3D-nuScenes
mIoU16.7
9
Showing 5 of 5 rows

Other info

Follow for update