Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography
About
Advancements in medical imaging AI, particularly in 3D imaging, have been limited due to the scarcity of comprehensive datasets. We introduce CT-RATE, a public dataset that pairs 3D medical images with corresponding textual reports. CT-RATE comprises 25,692 non-contrast 3D chest CT scans from 21,304 unique patients. Each scan is accompanied by its corresponding radiology report. Leveraging CT-RATE, we develop CT-CLIP, a CT-focused contrastive language-image pretraining framework designed for broad applications without the need for task-specific training. We demonstrate how CT-CLIP can be used in multi-abnormality detection and case retrieval, and outperforms state-of-the-art fully supervised models across all key metrics. By combining CT-CLIP's vision encoder with a pretrained large language model, we create CT-CHAT, a vision-language foundational chat model for 3D chest CT volumes. Finetuned on over 2.7 million question-answer pairs derived from the CT-RATE dataset, CT-CHAT underscores the necessity for specialized methods in 3D medical imaging. Collectively, the open-source release of CT-RATE, CT-CLIP, and CT-CHAT not only addresses critical challenges in 3D medical imaging but also lays the groundwork for future innovations in medical AI and improved patient care.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-Modal Visual Question Answering (MMVQA) | CT-RATE (val) | Accuracy23.14 | 57 | |
| Multi-Modal Visual Question Answering (MMVQA) | RAD-ChestCT (val) | Accuracy20.52 | 57 | |
| Conditional Image Retrieval | CTRATE-IR (test) | Recall@376.39 | 34 | |
| Medical Image Re-identification | CCII Lung-CT | CMC-R194.04 | 26 | |
| Medical Image Re-identification | HCC-TACE Abdominal-CT | CMC-R147.62 | 26 | |
| Medical Image Re-identification | LUAD Histopathology | CMC-R142.52 | 26 | |
| Medical Image Re-identification | LIHC Abdominal-CT | CMC-R128.57 | 26 | |
| Medical Image Re-identification | KIRC Abdominal-CT | CMC-R133.14 | 26 | |
| Medical Image Re-identification | OASIS Brain-MRI 2 | CMC-R137.99 | 26 | |
| Medical Image Re-identification | Mess2 Fundus | CMC-R133.14 | 26 |