Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A generalizable foundation model for intraoperative understanding across surgical procedures

About

In minimally invasive surgery, clinical decisions depend on real-time visual interpretation, yet intraoperative perception varies substantially across surgeons and procedures. This variability limits consistent assessment, training, and the development of reliable artificial intelligence systems, as most surgical AI models are designed for narrowly defined tasks and do not generalize across procedures or institutions. Here we introduce ZEN, a generalizable foundation model for intraoperative surgical video understanding trained on more than 4 million frames from over 21 procedures using a self-supervised multi-teacher distillation framework. We curated a large and diverse dataset and systematically evaluated multiple representation learning strategies within a unified benchmark. Across 20 downstream tasks and full fine-tuning, frozen-backbone, few-shot and zero-shot settings, ZEN consistently outperforms existing surgical foundation models and demonstrates robust cross-procedure generalization. These results suggest a step toward unified representations for surgical scene understanding and support future applications in intraoperative assistance and surgical training assessment.

Kanggil Park, Yongjun Jeon, Soyoung Lim, Seonmin Park, Jongmin Shin, Jung Yong Kim, Sehyeon An, Jinsoo Rhu, Jongman Kim, Gyu-Seong Choi, Namkee Oh, Kyu-Hwan Jung• 2026

Related benchmarks

TaskDatasetResultRank
Surgical Phase RecognitionCholec80--
35
Action Triplet RecognitionCholecT50
AP (I)86.11
27
Closed-ended Visual Question AnsweringPitVQA
F1 Score60.43
26
Closed-ended Visual Question AnsweringLLS48-VQA
F1 Score23.35
26
Depth EstimationHamlyn
Abs Rel0.1554
26
Instance SegmentationGrasp
mAP (Mask)0.5597
26
Object DetectionGrasp
mAP (BBox)62.5
26
Semantic segmentationCholecSeg8k
DSC0.8187
26
Semantic segmentationGrasp
DSC78.12
26
Skill AssessmentCholec80 CVS
mAP0.3856
26
Showing 10 of 20 rows

Other info

Follow for update