Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation

About

In the past few years, convolutional neural networks (CNNs) have achieved milestones in medical image analysis. Especially, the deep neural networks based on U-shaped architecture and skip-connections have been widely applied in a variety of medical image tasks. However, although CNN has achieved excellent performance, it cannot learn global and long-range semantic information interaction well due to the locality of the convolution operation. In this paper, we propose Swin-Unet, which is an Unet-like pure Transformer for medical image segmentation. The tokenized image patches are fed into the Transformer-based U-shaped Encoder-Decoder architecture with skip-connections for local-global semantic feature learning. Specifically, we use hierarchical Swin Transformer with shifted windows as the encoder to extract context features. And a symmetric Swin Transformer-based decoder with patch expanding layer is designed to perform the up-sampling operation to restore the spatial resolution of the feature maps. Under the direct down-sampling and up-sampling of the inputs and outputs by 4x, experiments on multi-organ and cardiac segmentation tasks demonstrate that the pure Transformer-based U-shaped Encoder-Decoder network outperforms those methods with full-convolution or the combination of transformer and convolution. The codes and trained models will be publicly available at https://github.com/HuCaoFighting/Swin-Unet.

Hu Cao, Yueyue Wang, Joy Chen, Dongsheng Jiang, Xiaopeng Zhang, Qi Tian, Manning Wang• 2021

Related benchmarks

TaskDatasetResultRank
Cardiac SegmentationACDC (test)
Avg Dice90
141
Medical Image SegmentationACDC (test)
Avg DSC90.25
135
Medical Image SegmentationBUSI (test)
Dice77.38
121
Medical Image SegmentationSynapse (test)
Dice79.13
111
Skin Lesion SegmentationISIC 2017 (test)
Dice Score88.15
100
Medical Image SegmentationISIC 2018
Dice Score86.57
92
Multi-organ SegmentationSynapse multi-organ CT (test)
DSC79.13
81
Medical Image SegmentationMedical Image Segmentation Aggregate (Average of BUSI, BTMRI, ISIC, Kvasir-SEG, QaTa-COV19, and EUS) (test)
DSC65.03
80
Medical Image SegmentationKvasir-SEG (test)
mIoU77.56
78
Medical Image SegmentationKvasir-Seg
Dice Score80.67
75
Showing 10 of 182 rows
...

Other info

Code

Follow for update