Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CAT-SG: A Large Dynamic Scene Graph Dataset for Fine-Grained Understanding of Cataract Surgery

About

Understanding the intricate workflows of cataract surgery requires modeling complex interactions between surgical tools, anatomical structures, and procedural techniques. Existing datasets primarily address isolated aspects of surgical analysis, such as tool detection or phase segmentation, but lack comprehensive representations that capture the semantic relationships between entities over time. This paper introduces the Cataract Surgery Scene Graph (CAT-SG) dataset, the first to provide structured annotations of tool-tissue interactions, procedural variations, and temporal dependencies. By incorporating detailed semantic relations, CAT-SG offers a holistic view of surgical workflows, enabling more accurate recognition of surgical phases and techniques. Additionally, we present a novel scene graph generation model, CatSGG, which outperforms current methods in generating structured surgical representations. The CAT-SG dataset is designed to enhance AI-driven surgical training, real-time decision support, and workflow analysis, paving the way for more intelligent, context-aware systems in clinical practice.

Felix Holm, G\"ozde \"Unver, Ghazal Ghazaei, Nassir Navab• 2025

Related benchmarks

TaskDatasetResultRank
Surgical workflow recognitionCAT-SG
Accuracy0.7863
5
Scene Graph GenerationCAT-SG
Close To91.63
4
Showing 2 of 2 rows

Other info

Follow for update