Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TFusionOcc: Student's t-Distribution Based Object-Centric Multi-Sensor Fusion Framework for 3D Occupancy Prediction

About

3D semantic occupancy prediction enables autonomous vehicles (AVs) to perceive fine-grained geometric and semantic structure of their surroundings from onboard sensors, which is essential for safe decision-making and navigation. Recent models for 3D semantic occupancy prediction have successfully addressed the challenge of describing real-world objects with varied shapes and classes. However, the intermediate representations used by existing methods for 3D semantic occupancy prediction rely heavily on 3D voxel volumes or a set of 3D Gaussians, hindering the model's ability to efficiently and effectively capture fine-grained geometric details in the 3D driving environment. This paper introduces TFusionOcc, a novel object-centric multi-sensor fusion framework for predicting 3D semantic occupancy. By leveraging multi-stage multi-sensor fusion, Student's t-distribution, and the T-Mixture model (TMM), together with more geometrically flexible primitives, such as the deformable superquadric (superquadric with inverse warp), the proposed method achieved state-of-the-art (SOTA) performance on the nuScenes benchmark. In addition, extensive experiments were conducted on the nuScenes-C dataset to demonstrate the robustness of the proposed method in different camera and lidar corruption scenarios. The code will be available at: https://github.com/DanielMing123/TFusionOcc

Zhenxing Ming, Julie Stephany Berrio, Mao Shan, Stewart Worrall• 2026

Related benchmarks

TaskDatasetResultRank
3D Occupancy PredictionOcc3D-nuScenes (val)
mIoU5.34e+3
144
3D Semantic Occupancy PredictionSurroundOcc-nuScenes (val)
IoU47.01
31
3D Semantic Occupancy PredictionSurroundOcc-nuScenes rainy scenario (val)
mIoU30.82
26
3D Semantic Occupancy PredictionSurroundOcc-nuScenes night scenario (val)
mIoU (Mean IoU)19.67
22
3D Semantic Occupancy PredictionnuScenes-C Camera Corruption v1.0 (val)
Clean Score47.01
12
3D Semantic Occupancy PredictionnuScenes (val)
Latency (ms)278.5
11
3D Semantic Occupancy PredictionnuScenes-C Lidar Corruption
mIoU (Clean)27.5
10
Showing 7 of 7 rows

Other info

Follow for update