VM-UNet: Vision Mamba UNet for Medical Image Segmentation
About
In the realm of medical image segmentation, both CNN-based and Transformer-based models have been extensively explored. However, CNNs exhibit limitations in long-range modeling capabilities, whereas Transformers are hampered by their quadratic computational complexity. Recently, State Space Models (SSMs), exemplified by Mamba, have emerged as a promising approach. They not only excel in modeling long-range interactions but also maintain a linear computational complexity. In this paper, leveraging state space models, we propose a U-shape architecture model for medical image segmentation, named Vision Mamba UNet (VM-UNet). Specifically, the Visual State Space (VSS) block is introduced as the foundation block to capture extensive contextual information, and an asymmetrical encoder-decoder structure is constructed with fewer convolution layers to save calculation cost. We conduct comprehensive experiments on the ISIC17, ISIC18, and Synapse datasets, and the results indicate that VM-UNet performs competitively in medical image segmentation tasks. To our best knowledge, this is the first medical image segmentation model constructed based on the pure SSM-based model. We aim to establish a baseline and provide valuable insights for the future development of more efficient and effective SSM-based segmentation systems. Our code is available at https://github.com/JCruan519/VM-UNet.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Medical Image Segmentation | BUSI (test) | Dice83.74 | 228 | |
| Medical Image Segmentation | ISIC 2018 | Dice Score89.2 | 187 | |
| Cardiac Segmentation | ACDC (test) | Avg Dice91.47 | 162 | |
| Skin Lesion Segmentation | ISIC 2018 (test) | Dice Score89.62 | 143 | |
| Skin Lesion Segmentation | ISIC 2017 (test) | Dice Score90.22 | 134 | |
| Medical Image Segmentation | BUSI | Dice Score61.35 | 134 | |
| Medical Image Segmentation | Synapse (test) | Dice81.08 | 123 | |
| Polyp Segmentation | ETIS | Dice Score70.56 | 122 | |
| Medical Image Segmentation | CVC-ClinicDB | Dice Score90.88 | 118 | |
| Polyp Segmentation | Kvasir-SEG (test) | mIoU0.84 | 116 |