Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training

About

Harnessing the power of pre-training on large-scale datasets like ImageNet forms a fundamental building block for the progress of representation learning-driven solutions in computer vision. Medical images are inherently different from natural images as they are acquired in the form of many modalities (CT, MR, PET, Ultrasound etc.) and contain granulated information like tissue, lesion, organs etc. These characteristics of medical images require special attention towards learning features representative of local context. In this work, we focus on designing an effective pre-training framework for 3D radiology images. First, we propose a new masking strategy called local masking where the masking is performed across channel embeddings instead of tokens to improve the learning of local feature representations. We combine this with classical low-level perturbations like adding noise and downsampling to further enable low-level representation learning. To this end, we introduce Disruptive Autoencoders, a pre-training framework that attempts to reconstruct the original image from disruptions created by a combination of local masking and low-level perturbations. Additionally, we also devise a cross-modal contrastive loss (CMCL) to accommodate the pre-training of multiple modalities in a single framework. We curate a large-scale dataset to enable pre-training of 3D medical radiology images (MRI and CT). The proposed pre-training framework is tested across multiple downstream tasks and achieves state-of-the-art performance. Notably, our proposed method tops the public test leaderboard of BTCV multi-organ segmentation challenge.

Jeya Maria Jose Valanarasu, Yucheng Tang, Dong Yang, Ziyue Xu, Can Zhao, Wenqi Li, Vishal M. Patel, Bennett Landman, Daguang Xu, Yufan He, Vishwesh Nath• 2023

Related benchmarks

TaskDatasetResultRank
SegmentationISLES 2022 (test)
HD95 (IS)4.13
34
SegmentationBraTS MET 2023 (test)
HD95 (ET)37.63
34
Brain Tumor SegmentationBraTS PED 2023 (test)
HD95 (ET)19.33
34
SegmentationMRBrainS 2013 (test)
HD95 (CSF)3.07
17
Brain Structure SegmentationMRBrainS13
CF Score71.37
17
ClassificationADHD-200 11 (test)
Accuracy66.88
17
SegmentationUPENN-GBM (test)
HD95 (ET)2.24
17
ClassificationBraTS 2018 (test)
ACC77.19
17
Brain Metastasis SegmentationBraTS-MET
Dice ET62.27
17
Glioblastoma SegmentationUPENN-GBM
ET Segmentation Score86.9
17
Showing 10 of 14 rows

Other info

Follow for update