Mask Transfiner for High-Quality Instance Segmentation
About
Two-stage and query-based instance segmentation methods have achieved remarkable results. However, their segmented masks are still very coarse. In this paper, we present Mask Transfiner for high-quality and efficient instance segmentation. Instead of operating on regular dense tensors, our Mask Transfiner decomposes and represents the image regions as a quadtree. Our transformer-based approach only processes detected error-prone tree nodes and self-corrects their errors in parallel. While these sparse pixels only constitute a small proportion of the total number, they are critical to the final mask quality. This allows Mask Transfiner to predict highly accurate instance masks, at a low computational cost. Extensive experiments demonstrate that Mask Transfiner outperforms current instance segmentation methods on three popular benchmarks, significantly improving both two-stage and query-based frameworks by a large margin of +3.0 mask AP on COCO and BDD100K, and +6.6 boundary AP on Cityscapes. Our code and trained models will be available at http://vis.xyz/pub/transfiner.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | COCO (test-dev) | mAP46.5 | 1195 | |
| Instance Segmentation | COCO (val) | APmk45.4 | 472 | |
| Instance Segmentation | COCO (test-dev) | APM44.8 | 380 | |
| Instance Segmentation | COCO 2017 (test-dev) | AP (Overall)41.6 | 253 | |
| Instance Segmentation | Cityscapes (val) | AP49.8 | 239 | |
| Instance Segmentation | DSIS | mAP67.5 | 23 | |
| Instance Segmentation | COME15K E | mAP48.7 | 23 | |
| Instance Segmentation | SIP | mAP57.8 | 23 | |
| Instance Segmentation | COME15K-H | mAP40.7 | 23 | |
| Instance Segmentation | BDD100K (val) | APmask23.6 | 5 |