Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Pixel-Wise Recognition for Holistic Surgical Scene Understanding

About

This paper presents the Holistic and Multi-Granular Surgical Scene Understanding of Prostatectomies (GraSP) dataset, a curated benchmark that models surgical scene understanding as a hierarchy of complementary tasks with varying levels of granularity. Our approach encompasses long-term tasks, such as surgical phase and step recognition, and short-term tasks, including surgical instrument segmentation and atomic visual actions detection. To exploit our proposed benchmark, we introduce the Transformers for Actions, Phases, Steps, and Instrument Segmentation (TAPIS) model, a general architecture that combines a global video feature extractor with localized region proposals from an instrument segmentation model to tackle the multi-granularity of our benchmark. Through extensive experimentation in ours and alternative benchmarks, we demonstrate TAPIS's versatility and state-of-the-art performance across different tasks. This work represents a foundational step forward in Endoscopic Vision, offering a novel framework for future research towards holistic surgical scene understanding.

Nicol\'as Ayobi, Santiago Rodr\'iguez, Alejandra P\'erez, Isabela Hern\'andez, Nicol\'as Aparicio, Eug\'enie Dessevres, Sebasti\'an Pe\~na, Jessica Santander, Juan Ignacio Caicedo, Nicol\'as Fern\'andez, Pablo Arbel\'aez• 2024

Related benchmarks

TaskDatasetResultRank
Phase RecognitionGraSP (test)
mAP76.72
10
Phase RecognitionMISAW
mAP97.14
10
Instrument Semantic SegmentationGraSP (cross-validation)
mIoU0.8705
8
Surgical Phase RecognitionHeiChole
F1 Score0.7341
8
Gesture RecognitionRARP-45 (test)
mAP57.25
6
Instrument Instance SegmentationGraSP (cross-validation)
mAP@0.5 (Box)92.65
6
Instrument Presence RecognitionGrasp
mAP94.33
6
Phase RecognitionMISAW (test)
mAP97.14
6
Step RecognitionMISAW (test)
mAP77.52
6
Atomic Action DetectionGraSP (test)
mAP@0.5 IoU (Box)39.26
4
Showing 10 of 15 rows

Other info

Code

Follow for update