Endo-SemiS: Towards Robust Semi-Supervised Image Segmentation for Endoscopic Video

About

In this paper, we present Endo-SemiS, a semi-supervised segmentation framework for providing reliable segmentation of endoscopic video frames with limited annotation. EndoSemiS uses 4 strategies to improve performance by effectively utilizing all available data, particularly unlabeled data: (1) Cross-supervision between two individual networks that supervise each other; (2) Uncertainty-guided pseudo-labels from unlabeled data, which are generated by selecting high-confidence regions to improve their quality; (3) Joint pseudolabel supervision, which aggregates reliable pixels from the pseudo-labels of both networks to provide accurate supervision for unlabeled data; and (4) Mutual learning, where both networks learn from each other at the feature and image levels, reducing variance and guiding them toward a consistent solution. Additionally, a separate corrective network that utilizes spatiotemporal information from endoscopy video to improve segmentation performance. Endo-SemiS is evaluated on two clinical applications: kidney stone laser lithotomy from ureteroscopy and polyp screening from colonoscopy. Compared to state-of-the-art segmentation methods, Endo-SemiS substantially achieves superior results on both datasets with limited labeled data. The code is publicly available at https://github.com/MedICL-VU/Endo-SemiS

Hao Li, Daiwei Lu, Xing Yao, Nicholas Kavoussi, Ipek Oguz• 2025

Related benchmarks

Task	Dataset	Result
Pixel-level Segmentation	Kidney (test)	Dice87.6	13
Image-level Target Presence Detection	Kidney (test)	Precision95	13
Polyp Segmentation	Polyp dataset single frame (10% labeled data)	Dice79	4
Polyp Segmentation	Polyp dataset sequence frame (10% labeled data)	Dice71	4

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord