Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SwinMM: Masked Multi-view with Swin Transformers for 3D Medical Image Segmentation

About

Recent advancements in large-scale Vision Transformers have made significant strides in improving pre-trained models for medical image segmentation. However, these methods face a notable challenge in acquiring a substantial amount of pre-training data, particularly within the medical field. To address this limitation, we present Masked Multi-view with Swin Transformers (SwinMM), a novel multi-view pipeline for enabling accurate and data-efficient self-supervised medical image analysis. Our strategy harnesses the potential of multi-view information by incorporating two principal components. In the pre-training phase, we deploy a masked multi-view encoder devised to concurrently train masked multi-view observations through a range of diverse proxy tasks. These tasks span image reconstruction, rotation, contrastive learning, and a novel task that employs a mutual learning paradigm. This new task capitalizes on the consistency between predictions from various perspectives, enabling the extraction of hidden multi-view information from 3D medical data. In the fine-tuning stage, a cross-view decoder is developed to aggregate the multi-view information through a cross-attention block. Compared with the previous state-of-the-art self-supervised learning method Swin UNETR, SwinMM demonstrates a notable advantage on several medical image segmentation tasks. It allows for a smooth integration of multi-view information, significantly boosting both the accuracy and data-efficiency of the model. Code and models are available at https://github.com/UCSC-VLAA/SwinMM/.

Yiqing Wang, Zihan Li, Jieru Mei, Zihao Wei, Li Liu, Chen Wang, Shengtian Sang, Alan Yuille, Cihang Xie, Yuyin Zhou• 2023

Related benchmarks

TaskDatasetResultRank
Medical Image SegmentationMM-WHS (test)
Dice Score86.98
62
Multi-organ SegmentationBTCV (test)
Spl94.33
55
Liver SegmentationLiTS
Dice Score95.52
29
Organ SegmentationWORD
Overall DICE86.18
20
Medical Image SegmentationMSD Spleen (test)
Dice Score95.34
18
Brain Tumor SegmentationBraTS 21
Dice TC83.48
14
SegmentationAMOS
1-shot Score63.56
13
SegmentationBraTS 21
Performance (1-shot)57.12
13
SegmentationWORD
1-shot Acc74.65
13
SegmentationBTCV
1-shot Score72.28
13
Showing 10 of 15 rows

Other info

Code

Follow for update