Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MV-Swin-T: Mammogram Classification with Multi-view Swin Transformer

About

Traditional deep learning approaches for breast cancer classification has predominantly concentrated on single-view analysis. In clinical practice, however, radiologists concurrently examine all views within a mammography exam, leveraging the inherent correlations in these views to effectively detect tumors. Acknowledging the significance of multi-view analysis, some studies have introduced methods that independently process mammogram views, either through distinct convolutional branches or simple fusion strategies, inadvertently leading to a loss of crucial inter-view correlations. In this paper, we propose an innovative multi-view network exclusively based on transformers to address challenges in mammographic image classification. Our approach introduces a novel shifted window-based dynamic attention block, facilitating the effective integration of multi-view information and promoting the coherent transfer of this information between views at the spatial feature map level. Furthermore, we conduct a comprehensive comparative analysis of the performance and effectiveness of transformer-based models under diverse settings, employing the CBIS-DDSM and Vin-Dr Mammo datasets. Our code is publicly available at https://github.com/prithuls/MV-Swin-T

Sushmita Sarker, Prithul Sarker, George Bebis, Alireza Tavakkoli• 2024

Related benchmarks

TaskDatasetResultRank
Mammogram ClassificationCBIS-DDSM (test)
AUC56.6
24
ClassificationCBIS-DDSM mass
AUC0.721
11
Medical Image ClassificationMURA 2 classes (test)
AUROC0.711
8
Medical Image ClassificationCheXpert 13 classes (test)
AUROC0.882
8
Showing 4 of 4 rows

Other info

Follow for update