EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing
About
To make medical datasets accessible without sharing sensitive patient information, we introduce a novel end-to-end approach for generative de-identification of dynamic medical imaging data. Until now, generative methods have faced constraints in terms of fidelity, spatio-temporal coherence, and the length of generation, failing to capture the complete details of dataset distributions. We present a model designed to produce high-fidelity, long and complete data samples with near-real-time efficiency and explore our approach on a challenging task: generating echocardiogram videos. We develop our generation method based on diffusion models and introduce a protocol for medical video dataset anonymization. As an exemplar, we present EchoNet-Synthetic, a fully synthetic, privacy-compliant echocardiogram dataset with paired ejection fraction labels. As part of our de-identification protocol, we evaluate the quality of the generated dataset and propose to use clinical downstream tasks as a measurement on top of widely used but potentially biased image quality metrics. Experimental outcomes demonstrate that EchoNet-Synthetic achieves comparable dataset fidelity to the actual dataset, effectively supporting the ejection fraction regression task. Code, weights and dataset are available at https://github.com/HReynaud/EchoNet-Synthetic.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Ejection Fraction Prediction | EchoNet-Dynamic (test) | R20.75 | 44 | |
| Video Generation | EchoNet-Dynamic (test) | FID17.4 | 13 | |
| LVEF regression | Pediatric A4C view (test) | R2 Score0.94 | 5 | |
| LVEF regression | Pediatric PSAX view (test) | R20.68 | 4 | |
| Ultrasound Video Generation | EchoNet-Dynamic LIDM Generated Hearts (test) | FID22.6 | 3 | |
| Ultrasound Video Generation | EchoNet-Dynamic Encoded Real Hearts (test) | FID17.4 | 2 | |
| LVEF regression | Synthetic EchoNet-Dynamic (test) | R20.93 | 1 | |
| LVEF regression | Synthetic Pediatric PSAX (test) | R20.96 | 1 | |
| Ultrasound Video Generation | EchoNet-Pediatric A4C (Encoded Real Hearts) (test) | FID24.8 | 1 | |
| Ultrasound Video Generation | EchoNet-Pediatric PSAX Encoded Real Hearts (test) | FID33 | 1 |