Vision Foundation Models for Computed Tomography

About

Foundation models (FMs) have shown transformative potential in radiology by performing diverse, complex tasks across imaging modalities. Here, we developed CT-FM, a large-scale 3D image-based pre-trained model designed explicitly for various radiological tasks. CT-FM was pre-trained using 148,000 computed tomography (CT) scans from the Imaging Data Commons through label-agnostic contrastive learning. We evaluated CT-FM across four categories of tasks, namely, whole-body and tumor segmentation, head CT triage, medical image retrieval, and semantic understanding, showing superior performance against state-of-the-art models. Beyond quantitative success, CT-FM demonstrated the ability to cluster regions anatomically and identify similar anatomical and structural concepts across scans. Furthermore, it remained robust across test-retest settings and indicated reasonable salient regions attached to its embeddings. This study demonstrates the value of large-scale medical imaging foundation models and by open-sourcing the model weights, code, and data, aims to support more adaptable, reliable, and interpretable AI solutions in radiology.

Suraj Pai, Ibrahim Hadzic, Dennis Bontempi, Keno Bressem, Benjamin H. Kann, Andriy Fedorov, Raymond H. Mak, Hugo J. W. L. Aerts• 2025

Related benchmarks

Task	Dataset	Result
Medical Image Segmentation	MSD Pancreas (test)	DSC81.8	30
Multi-label abnormality classification	RAD-ChestCT (test)	AUROC0.6849	26
Multi-label Abnormality Analysis	CT-RATE (test)	AUROC0.7435	24
Disease Classification	CT-RATE (test)	AUC76.08	16
3D Segmentation	MSD-Liver	DSC93.4	15
Visual Segmentation	KiTS23	KTC Dice Score0.969	14
Abnormality Detection	Merlin-Abd-CT	AUROC63.92	12
Abnormality Detection	EXT-Chest-CT	AUROC63.01	12
Abnormality Detection	Avg EXT	AUROC61.79	12
Finding Classification	Merlin-Abd-CT (test)	AUROC0.7041	12

Showing 10 of 21 rows

Other info

Follow for update

@wizwand_team Discord