Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Find Any Part in 3D

About

Why don't we have foundation models in 3D yet? A key limitation is data scarcity. For 3D object part segmentation, existing datasets are small in size and lack diversity. We show that it is possible to break this data barrier by building a data engine powered by 2D foundation models. Our data engine automatically annotates any number of object parts: 1755x more unique part types than existing datasets combined. By training on our annotated data with a simple contrastive objective, we obtain an open-world model that generalizes to any part in any object based on any text query. Even when evaluated zero-shot, we outperform existing methods on the datasets they train on. We achieve 260% improvement in mIoU and boost speed by 6x to 300x. Our scaling analysis confirms that this generalization stems from the data scale, which underscores the impact of our data engine. Finally, to advance general-category open-world 3D part segmentation, we release a benchmark covering a wide range of objects and parts. Project website: https://ziqi-ma.github.io/find3dsite/

Ziqi Ma, Yisong Yue, Georgia Gkioxari• 2024

Related benchmarks

TaskDatasetResultRank
Part SegmentationPartNet (test)
mIoU35.4
19
Part SegmentationPartNetE
mIoU16.4
9
Part SegmentationFaust
mIoU63.2
6
Part SegmentationShapeNetPart 51 (test)
mIoU23.3
6
Part SegmentationScanObjectNN
mIoU18.8
4
Part Segmentation3DCoMPaT (test)
mIoU23.9
4
Part SegmentationFind3D (test)
mIoU37.9
4
Part SegmentationObjaverse General (Seen)
mIoU28.9
2
Part SegmentationObjaverse General (Unseen)
mIoU34.6
2
Showing 9 of 9 rows

Other info

Follow for update