Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mask Transfiner for High-Quality Instance Segmentation

About

Two-stage and query-based instance segmentation methods have achieved remarkable results. However, their segmented masks are still very coarse. In this paper, we present Mask Transfiner for high-quality and efficient instance segmentation. Instead of operating on regular dense tensors, our Mask Transfiner decomposes and represents the image regions as a quadtree. Our transformer-based approach only processes detected error-prone tree nodes and self-corrects their errors in parallel. While these sparse pixels only constitute a small proportion of the total number, they are critical to the final mask quality. This allows Mask Transfiner to predict highly accurate instance masks, at a low computational cost. Extensive experiments demonstrate that Mask Transfiner outperforms current instance segmentation methods on three popular benchmarks, significantly improving both two-stage and query-based frameworks by a large margin of +3.0 mask AP on COCO and BDD100K, and +6.6 boundary AP on Cityscapes. Our code and trained models will be available at http://vis.xyz/pub/transfiner.

Lei Ke, Martin Danelljan, Xia Li, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu• 2021

Related benchmarks

TaskDatasetResultRank
Object DetectionCOCO (test-dev)
mAP46.5
1195
Instance SegmentationCOCO (val)
APmk45.4
472
Instance SegmentationCOCO (test-dev)
APM44.8
380
Instance SegmentationCOCO 2017 (test-dev)
AP (Overall)41.6
253
Instance SegmentationCityscapes (val)
AP49.8
239
Instance SegmentationDSIS
mAP67.5
23
Instance SegmentationCOME15K E
mAP48.7
23
Instance SegmentationSIP
mAP57.8
23
Instance SegmentationCOME15K-H
mAP40.7
23
Instance SegmentationBDD100K (val)
APmask23.6
5
Showing 10 of 10 rows

Other info

Code

Follow for update