Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Fine-Grained Image-Text Alignment in Medical Imaging Enables Explainable Cyclic Image-Report Generation

About

To address these issues, we propose a novel Adaptive patch-word Matching (AdaMatch) model to correlate chest X-ray (CXR) image regions with words in medical reports and apply it to CXR-report generation to provide explainability for the generation process. AdaMatch exploits the fine-grained relation between adaptive patches and words to provide explanations of specific image regions with corresponding words. To capture the abnormal regions of varying sizes and positions, we introduce the Adaptive Patch extraction (AdaPatch) module to acquire the adaptive patches for these regions adaptively. In order to provide explicit explainability for CXR-report generation task, we propose an AdaMatch-based bidirectional large language model for Cyclic CXR-report generation (AdaMatch-Cyclic). It employs the AdaMatch to obtain the keywords for CXR images and `keypatches' for medical reports as hints to guide CXR-report generation. Extensive experiments on two publicly available CXR datasets prove the effectiveness of our method and its superior performance to existing methods.

Wenting Chen, Linlin Shen, Jingyang Lin, Jiebo Luo, Xiang Li, Yixuan Yuan• 2023

Related benchmarks

TaskDatasetResultRank
Radiology Report GenerationMIMIC-CXR (test)
BLEU-40.106
121
Report GenerationMIMIC-CXR (test)
BLEU-40.106
20
CXR-to-report generationOPENI (test)
BLEU-10.4161
18
CXR-to-Report RetrievalMIMIC-CXR
Recall@151.47
9
Report-to-CXR RetrievalMIMIC-CXR
Recall@151.18
9
Report-to-CXR GenerationMIMIC-CXR
FID1.0916
6
Report-to-CXR GenerationOpenI
FID1.5938
6
Showing 7 of 7 rows

Other info

Follow for update