Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?

About

The pre-training and fine-tuning paradigm has become prominent in transfer learning. For example, if the model is pre-trained on ImageNet and then fine-tuned to PASCAL, it can significantly outperform that trained on PASCAL from scratch. While ImageNet pre-training has shown enormous success, it is formed in 2D, and the learned features are for classification tasks; when transferring to more diverse tasks, like 3D image segmentation, its performance is inevitably compromised due to the deviation from the original ImageNet context. A significant challenge lies in the lack of large, annotated 3D datasets rivaling the scale of ImageNet for model pre-training. To overcome this challenge, we make two contributions. Firstly, we construct AbdomenAtlas 1.1 that comprises 9,262 three-dimensional computed tomography (CT) volumes with high-quality, per-voxel annotations of 25 anatomical structures and pseudo annotations of seven tumor types. Secondly, we develop a suite of models that are pre-trained on our AbdomenAtlas 1.1 for transfer learning. Our preliminary analyses indicate that the model trained only with 21 CT volumes, 672 masks, and 40 GPU hours has a transfer learning ability similar to the model trained with 5,050 (unlabeled) CT volumes and 1,152 GPU hours. More importantly, the transfer learning ability of supervised models can further scale up with larger annotated datasets, achieving significantly better performance than preexisting pre-trained models, irrespective of their pre-training methodologies or data sources. We hope this study can facilitate collective efforts in constructing larger 3D medical datasets and more releases of supervised pre-trained models.

Wenxuan Li, Alan Yuille, Zongwei Zhou• 2025

Related benchmarks

TaskDatasetResultRank
SegmentationLiver tumour
DSC65.4
30
SegmentationLung tumour
DSC66.1
30
SegmentationGallbladder cancer
DSC21.7
15
Invasion ClassificationStructured Report Generation
Accuracy81
4
Survival PredictionLung cancer
AUC0.687
4
Survival PredictionRectal cancer
AUC71.4
4
Tumour DetectionNon-contrast CT Liver cancer
AUC0.855
4
Tumour DetectionNon-contrast CT Pancreatic cancer
AUC0.944
4
Tumour DetectionNon-contrast CT Kidney cancer
AUC90.8
4
Density ClassificationStructured Report Generation dataset
ACC63.7
4
Showing 10 of 18 rows

Other info

Follow for update