Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Masked Image Modelling for retinal OCT understanding

About

This work explores the effectiveness of masked image modelling for learning representations of retinal OCT images. To this end, we leverage Masked Autoencoders (MAE), a simple and scalable method for self-supervised learning, to obtain a powerful and general representation for OCT images by training on 700K OCT images from 41K patients collected under real world clinical settings. We also provide the first extensive evaluation for a model of OCT on a challenging battery of 6 downstream tasks. Our model achieves strong performance when fully finetuned but can also serve as a versatile frozen feature extractor for many tasks using lightweight adapters. Furthermore, we propose an extension of the MAE pretraining to fuse OCT with an auxiliary modality, namely, IR fundus images and learn a joint model for both. We demonstrate our approach improves performance on a multimodal downstream application. Our experiments utilize most publicly available OCT datasets, thus enabling future comparisons. Our code and model weights are publicly available https://github.com/TheoPis/MIM_OCT.

Theodoros Pissas, Pablo M\'arquez-Neila, Sebastian Wolf, Martin Zinkernagel, Raphael Sznitman• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationOCTDL
Accuracy (ACC)94.09
13
Large Vessel SegmentationOCTA-500 50% labels (train test)
Dice73.8
5
Large Vessel SegmentationOCTA-500 100% labels (train test)
Dice Score79.7
5
Image ClassificationOCTID Ophthalmology OCT
Accuracy95.71
4
Image ClassificationTVHL-DME Ophthalmology OCT
Accuracy92.71
4
Image ClassificationTVHL-DR Ophthalmology OCT
Accuracy84.68
4
Showing 6 of 6 rows

Other info

Follow for update