RefineMask: Towards High-Quality Instance Segmentation with Fine-Grained Features
About
The two-stage methods for instance segmentation, e.g. Mask R-CNN, have achieved excellent performance recently. However, the segmented masks are still very coarse due to the downsampling operations in both the feature pyramid and the instance-wise pooling process, especially for large objects. In this work, we propose a new method called RefineMask for high-quality instance segmentation of objects and scenes, which incorporates fine-grained features during the instance-wise segmenting process in a multi-stage manner. Through fusing more detailed information stage by stage, RefineMask is able to refine high-quality masks consistently. RefineMask succeeds in segmenting hard cases such as bent parts of objects that are over-smoothed by most previous methods and outputs accurate boundaries. Without bells and whistles, RefineMask yields significant gains of 2.6, 3.4, 3.8 AP over Mask R-CNN on COCO, LVIS, and Cityscapes benchmarks respectively at a small amount of additional computational cost. Furthermore, our single-model result outperforms the winner of the LVIS Challenge 2020 by 1.3 points on the LVIS test-dev set and establishes a new state-of-the-art. Code will be available at https://github.com/zhanggang001/RefineMask.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | COCO (test-dev) | mAP43.8 | 1195 | |
| Instance Segmentation | COCO 2017 (val) | APm0.531 | 1144 | |
| Instance Segmentation | COCO (val) | APmk42.3 | 472 | |
| Instance Segmentation | COCO (test-dev) | APM42 | 380 | |
| Instance Segmentation | COCO 2017 (test-dev) | AP (Overall)39.4 | 253 | |
| Instance Segmentation | Cityscapes (val) | AP49.2 | 239 | |
| Instance Segmentation | LVIS v1.0 (val) | AP (Rare)14.2 | 189 | |
| Instance Segmentation | LVIS v1.0 (test-dev) | AP42.5 | 4 |