SimTxtSeg: Weakly-Supervised Medical Image Segmentation with Simple Text Cues

About

Weakly-supervised medical image segmentation is a challenging task that aims to reduce the annotation cost while keep the segmentation performance. In this paper, we present a novel framework, SimTxtSeg, that leverages simple text cues to generate high-quality pseudo-labels and study the cross-modal fusion in training segmentation models, simultaneously. Our contribution consists of two key components: an effective Textual-to-Visual Cue Converter that produces visual prompts from text prompts on medical images, and a text-guided segmentation model with Text-Vision Hybrid Attention that fuses text and image features. We evaluate our framework on two medical image segmentation tasks: colonic polyp segmentation and MRI brain tumor segmentation, and achieve consistent state-of-the-art performance. Source code is available at: https://github.com/xyx1024/SimTxtSeg.

Yuxin Xie, Tao Zhou, Yi Zhou, Geng Chen• 2024

Related benchmarks

Task	Dataset	Result
Medical Image Segmentation	colon polyp	mIoU82.47	25
Segmentation	Brain Tumor	mIoU72.38	22
Segmentation	Polyp	mIoU82.47	16
Lesion Segmentation	clinical breast ultrasound dataset	mDSC84.03	13
Segmentation	MRI Brain Tumor	mDSC81.74	9

Showing 5 of 5 rows

Other info

Code

Follow for update

@wizwand_team Discord