Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SimTxtSeg: Weakly-Supervised Medical Image Segmentation with Simple Text Cues

About

Weakly-supervised medical image segmentation is a challenging task that aims to reduce the annotation cost while keep the segmentation performance. In this paper, we present a novel framework, SimTxtSeg, that leverages simple text cues to generate high-quality pseudo-labels and study the cross-modal fusion in training segmentation models, simultaneously. Our contribution consists of two key components: an effective Textual-to-Visual Cue Converter that produces visual prompts from text prompts on medical images, and a text-guided segmentation model with Text-Vision Hybrid Attention that fuses text and image features. We evaluate our framework on two medical image segmentation tasks: colonic polyp segmentation and MRI brain tumor segmentation, and achieve consistent state-of-the-art performance. Source code is available at: https://github.com/xyx1024/SimTxtSeg.

Yuxin Xie, Tao Zhou, Yi Zhou, Geng Chen• 2024

Related benchmarks

TaskDatasetResultRank
Medical Image Segmentationcolon polyp
mIoU82.47
25
SegmentationBrain Tumor
mIoU72.38
22
SegmentationPolyp
mIoU82.47
16
Lesion Segmentationclinical breast ultrasound dataset
mDSC84.03
13
SegmentationMRI Brain Tumor
mDSC81.74
9
Showing 5 of 5 rows

Other info

Code

Follow for update