Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

RadIR: A Scalable Framework for Multi-Grained Medical Image Retrieval via Radiology Report Mining

About

Developing advanced medical imaging retrieval systems is challenging due to the varying definitions of `similar images' across different medical contexts. This challenge is compounded by the lack of large-scale, high-quality medical imaging retrieval datasets and benchmarks. In this paper, we propose a novel methodology that leverages dense radiology reports to define image-wise similarity ordering at multiple granularities in a scalable and fully automatic manner. Using this approach, we construct two comprehensive medical imaging retrieval datasets: MIMIC-IR for Chest X-rays and CTRATE-IR for CT scans, providing detailed image-image ranking annotations conditioned on diverse anatomical structures. Furthermore, we develop two retrieval systems, RadIR-CXR and model-ChestCT, which demonstrate superior performance in traditional image-image and image-report retrieval tasks. These systems also enable flexible, effective image retrieval conditioned on specific anatomical structures described in text, achieving state-of-the-art results on 77 out of 78 metrics.

Tengfei Zhang, Ziheng Zhao, Chaoyi Wu, Xiao Zhou, Ya Zhang, Yanfeng Wang, Weidi Xie• 2025

Related benchmarks

TaskDatasetResultRank
Anatomy-conditioned Image RetrievalMIMIC-IR official (test)
Recall@330.65
44
Conditional Image RetrievalCTRATE-IR (test)
Recall@389.29
34
Unconditional Image-to-Image RetrievalMIMIC-IR Chest X-Ray
Recall@55.18
4
Unconditional Image-to-Report RetrievalMIMIC-IR Chest X-Ray
Recall@54.33
4
Unconditional Image-to-Image RetrievalCTRATE-IR Chest CT
Recall@520.75
2
Unconditional Image-to-Report RetrievalCTRATE-IR Chest CT
Recall@56.65
2
Showing 6 of 6 rows

Other info

Follow for update