Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation

About

3D instance segmentation aims to predict a set of object instances in a scene and represent them as binary foreground masks with corresponding semantic labels. Currently, transformer-based methods are gaining increasing attention due to their elegant pipelines, reduced manual selection of geometric properties, and superior performance. However, transformer-based methods fail to simultaneously maintain strong position and content information during query initialization. Additionally, due to supervision at each decoder layer, there exists a phenomenon of object disappearance with the deepening of layers. To overcome these hurdles, we introduce Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation (BFL). Specifically, an Agent-Interpolation Initialization Module is designed to generate resilient queries capable of achieving a balance between foreground coverage and content learning. Additionally, a Hierarchical Query Fusion Decoder is designed to retain low overlap queries, mitigating the decrease in recall with the deepening of layers. Extensive experiments on ScanNetV2, ScanNet200, ScanNet++ and S3DIS datasets demonstrate the superior performance of BFL.

Jiahao Lu, Jiacheng Deng, Tianzhu Zhang• 2025

Related benchmarks

TaskDatasetResultRank
3D Object DetectionScanNet V2 (val)--
352
3D Instance SegmentationScanNet V2 (val)
Average AP5079.5
195
3D Instance SegmentationScanNet v2 (test)
mAP60.6
135
3D Instance SegmentationS3DIS (Area 5)
mAP@50% IoU71.9
106
Instance SegmentationScanNetV2 (val)--
58
3D Instance SegmentationScanNet++ V1 (val)
AP5035.2
12
3D Instance SegmentationScanNet200 v2 (val)
mAP (%)30.5
10
3D Instance SegmentationScanNet++ V1 (test)
mAP22.2
7
Semantic 3D instance segmentationScanNet++ (val)
AP25.3
6
3D Instance SegmentationScanNet++ (test)
mAP22.2
5
Showing 10 of 10 rows

Other info

Follow for update