Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

brat: Aligned Multi-View Embeddings for Brain MRI Analysis

About

We present brat (brain report alignment transformer), a multi-view representation learning framework for brain magnetic resonance imaging (MRI) trained on MRIs paired with clinical reports. Brain MRIs present unique challenges due to the presence of numerous, highly varied, and often subtle abnormalities that are localized to a few slices within a 3D volume. To address these challenges, we introduce a brain MRI dataset $10\times$ larger than existing ones, containing approximately 80,000 3D scans with corresponding radiology reports, and propose a multi-view pre-training approach inspired by advances in document retrieval. We develop an implicit query-feature matching mechanism and adopt concepts from quality-diversity to obtain multi-view embeddings of MRIs that are aligned with the clinical features given by report sentences. We evaluate our approach across multiple vision-language and vision tasks, demonstrating substantial performance improvements. The brat foundation models are publicly released.

Maxime Kayser, Maksim Gridnev, Wanting Wang, Max Bain, Aneesh Rangnekar, Avijit Chatterjee, Aleksandr Petrov, Harini Veeraraghavan, Nathaniel C. Swinburne• 2025

Related benchmarks

TaskDatasetResultRank
Text-to-Image RetrievalBIMCV-R
R@13
10
Image-to-Text RetrievalMSKBrain
R@120.1
9
Text-to-Image RetrievalMSKBrain
R@120.5
9
Image-to-Text RetrievalBIMCV-R lung CT (test)
R@10.036
6
Showing 4 of 4 rows

Other info

Follow for update