RadIR: A Scalable Framework for Multi-Grained Medical Image Retrieval via Radiology Report Mining
About
Developing advanced medical imaging retrieval systems is challenging due to the varying definitions of `similar images' across different medical contexts. This challenge is compounded by the lack of large-scale, high-quality medical imaging retrieval datasets and benchmarks. In this paper, we propose a novel methodology that leverages dense radiology reports to define image-wise similarity ordering at multiple granularities in a scalable and fully automatic manner. Using this approach, we construct two comprehensive medical imaging retrieval datasets: MIMIC-IR for Chest X-rays and CTRATE-IR for CT scans, providing detailed image-image ranking annotations conditioned on diverse anatomical structures. Furthermore, we develop two retrieval systems, RadIR-CXR and model-ChestCT, which demonstrate superior performance in traditional image-image and image-report retrieval tasks. These systems also enable flexible, effective image retrieval conditioned on specific anatomical structures described in text, achieving state-of-the-art results on 77 out of 78 metrics.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Anatomy-conditioned Image Retrieval | MIMIC-IR official (test) | Recall@330.65 | 44 | |
| Conditional Image Retrieval | CTRATE-IR (test) | Recall@389.29 | 34 | |
| Unconditional Image-to-Image Retrieval | MIMIC-IR Chest X-Ray | Recall@55.18 | 4 | |
| Unconditional Image-to-Report Retrieval | MIMIC-IR Chest X-Ray | Recall@54.33 | 4 | |
| Unconditional Image-to-Image Retrieval | CTRATE-IR Chest CT | Recall@520.75 | 2 | |
| Unconditional Image-to-Report Retrieval | CTRATE-IR Chest CT | Recall@56.65 | 2 |