Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multimodal Whole Slide Foundation Model for Pathology

About

The field of computational pathology has been transformed with recent advances in foundation models that encode histopathology region-of-interests (ROIs) into versatile and transferable feature representations via self-supervised learning (SSL). However, translating these advancements to address complex clinical challenges at the patient and slide level remains constrained by limited clinical data in disease-specific cohorts, especially for rare clinical conditions. We propose TITAN, a multimodal whole slide foundation model pretrained using 335,645 WSIs via visual self-supervised learning and vision-language alignment with corresponding pathology reports and 423,122 synthetic captions generated from a multimodal generative AI copilot for pathology. Without any finetuning or requiring clinical labels, TITAN can extract general-purpose slide representations and generate pathology reports that generalize to resource-limited clinical scenarios such as rare disease retrieval and cancer prognosis. We evaluate TITAN on diverse clinical tasks and find that TITAN outperforms both ROI and slide foundation models across machine learning settings such as linear probing, few-shot and zero-shot classification, rare cancer retrieval and cross-modal retrieval, and pathology report generation.

Tong Ding, Sophia J. Wagner, Andrew H. Song, Richard J. Chen, Ming Y. Lu, Andrew Zhang, Anurag J. Vaidya, Guillaume Jaume, Muhammad Shaban, Ahrong Kim, Drew F.K. Williamson, Bowen Chen, Cristina Almagro-Perez, Paul Doucet, Sharifa Sahai, Chengkuan Chen, Daisuke Komura, Akihiro Kawabe, Shumpei Ishikawa, Georg Gerber, Tingying Peng, Long Phi Le, Faisal Mahmood• 2024

Related benchmarks

TaskDatasetResultRank
Cancer grading and stagingTCGA-KIRC external (test)
AUC67
14
Cancer grading and stagingTCGA-PRAD External (test)
AUC0.93
14
SubtypingPathobench BRACS subtyping
Balanced Accuracy60.1
13
Cancer grading and stagingTCGA-READ (val)
AUC0.88
13
Mutation PredictionPathobench CPTAC mutation prediction
AUC69.4
13
Cancer grading and stagingTCGA KIRC internal (val)
AUC69
13
Slide-level classificationBRACS
F1 Score63.7
10
Slide-level classificationCPTAC
F1 Score71.4
10
Tile-level classificationCATCH
F1 Score84.4
10
TMB10 predictionLUAD USA1 (test)
AUROC68.97
8
Showing 10 of 20 rows

Other info

Follow for update