TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation
About
Medical image segmentation - the prerequisite of numerous clinical needs - has been significantly prospered by recent advances in convolutional neural networks (CNNs). However, it exhibits general limitations on modeling explicit long-range relation, and existing cures, resorting to building deep encoders along with aggressive downsampling operations, leads to redundant deepened networks and loss of localized details. Hence, the segmentation task awaits a better solution to improve the efficiency of modeling global contexts while maintaining a strong grasp of low-level details. In this paper, we propose a novel parallel-in-branch architecture, TransFuse, to address this challenge. TransFuse combines Transformers and CNNs in a parallel style, where both global dependency and low-level spatial details can be efficiently captured in a much shallower manner. Besides, a novel fusion technique - BiFusion module is created to efficiently fuse the multi-level features from both branches. Extensive experiments demonstrate that TransFuse achieves the newest state-of-the-art results on both 2D and 3D medical image sets including polyp, skin lesion, hip, and prostate segmentation, with significant parameter decrease and inference speed improvement.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Polyp Segmentation | CVC-ClinicDB (test) | DSC94.2 | 196 | |
| Polyp Segmentation | Kvasir | Dice Score92 | 128 | |
| Medical Image Segmentation | BUSI (test) | Dice79.36 | 121 | |
| Polyp Segmentation | ETIS | Dice Score73.7 | 108 | |
| Skin Lesion Segmentation | ISIC 2017 (test) | Dice Score88.4 | 100 | |
| Medical Image Segmentation | ISIC 2018 | Dice Score89.28 | 92 | |
| Polyp Segmentation | ETIS (test) | Mean Dice73.7 | 86 | |
| Polyp Segmentation | ColonDB | mDice78.1 | 74 | |
| Skin Lesion Segmentation | ISIC 2018 (test) | Dice Score90.1 | 74 | |
| Polyp Segmentation | Kvasir (test) | Dice Coefficient92 | 73 |