Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OccAny: Generalized Unconstrained Urban 3D Occupancy

About

Relying on in-domain annotations and precise sensor-rig priors, existing 3D occupancy prediction methods are limited in both scalability and out-of-domain generalization. While recent visual geometry foundation models exhibit strong generalization capabilities, they were mainly designed for general purposes and lack one or more key ingredients required for urban occupancy prediction, namely metric prediction, geometry completion in cluttered scenes and adaptation to urban scenarios. We address this gap and present OccAny, the first unconstrained urban 3D occupancy model capable of operating on out-of-domain uncalibrated scenes to predict and complete metric occupancy coupled with segmentation features. OccAny is versatile and can predict occupancy from sequential, monocular, or surround-view images. Our contributions are three-fold: (i) we propose the first generalized 3D occupancy framework with (ii) Segmentation Forcing that improves occupancy quality while enabling mask-level prediction, and (iii) a Novel View Rendering pipeline that infers novel-view geometry to enable test-time view augmentation for geometry completion. Extensive experiments demonstrate that OccAny outperforms all visual geometry baselines on 3D occupancy prediction task, while remaining competitive with in-domain self-supervised methods across three input settings on two established urban occupancy prediction datasets. Our code is available at https://github.com/valeoai/OccAny .

Anh-Quan Cao, Tuan-Hung Vu• 2026

Related benchmarks

TaskDatasetResultRank
3D Occupancy PredictionOcc3D-NuScenes surround-view
IoU34.15
12
3D Occupancy PredictionOcc3D-NuScenes (out-of-domain)
Precision59.58
9
3D Occupancy PredictionSemanticKITTI Sequence setting
Precision43.38
9
3D Semantic Occupancy PredictionSemanticKITTI sequence
mIoU7.28
9
Occupancy PredictionSemanticKITTI (out-of-domain)
Precision45.64
9
3D Occupancy PredictionOcc3D-NuScenes Sequence setting
Precision48.09
9
Occupancy PredictionSemanticKITTI in-domain--
6
3D Occupancy PredictionSemanticKITTI monocular
Precision45.64
5
3D Occupancy PredictionOcc3D-NuScenes (in-domain)--
5
Ego Vehicle Trajectory PredictionnuScenes (val)
ADE1.86
3
Showing 10 of 10 rows

Other info

Follow for update