Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion

About

Instance segmentation in 3D is a challenging task due to the lack of large-scale annotated datasets. In this paper, we show that this task can be addressed effectively by leveraging instead 2D pre-trained models for instance segmentation. We propose a novel approach to lift 2D segments to 3D and fuse them by means of a neural field representation, which encourages multi-view consistency across frames. The core of our approach is a slow-fast clustering objective function, which is scalable and well-suited for scenes with a large number of objects. Unlike previous approaches, our method does not require an upper bound on the number of objects or object tracking across frames. To demonstrate the scalability of the slow-fast clustering, we create a new semi-realistic dataset called the Messy Rooms dataset, which features scenes with up to 500 objects per scene. Our approach outperforms the state-of-the-art on challenging scenes from the ScanNet, Hypersim, and Replica datasets, as well as on our newly created Messy Rooms dataset, demonstrating the effectiveness and scalability of our slow-fast clustering method.

Yash Bhalgat, Iro Laina, Jo\~ao F. Henriques, Andrew Zisserman, Andrea Vedaldi• 2023

Related benchmarks

TaskDatasetResultRank
Panoptic SegmentationScanNet V2
Panoptic Quality (PQ)37.35
14
Panoptic SegmentationScanNet++
PQ (Panoptic Quality)47.58
14
3D Instance SegmentationScanNet
PQscene62.3
7
3D Instance SegmentationReplica3D
PQ Scene59.1
7
3D Panoptic SegmentationMessy-Rooms
PQ^scene (Old Room, 25 Obj)78.9
5
Showing 5 of 5 rows

Other info

Follow for update