BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval

About

The burgeoning integration of 3D medical imaging into healthcare has led to a substantial increase in the workload of medical professionals. To assist clinicians in their diagnostic processes and alleviate their workload, the development of a robust system for retrieving similar case studies presents a viable solution. While the concept holds great promise, the field of 3D medical text-image retrieval is currently limited by the absence of robust evaluation benchmarks and curated datasets. To remedy this, our study presents a groundbreaking dataset, {BIMCV-R}, which includes an extensive collection of 8,069 3D CT volumes, encompassing over 2 million slices, paired with their respective radiological reports. Expanding upon the foundational work of our dataset, we craft a retrieval strategy, MedFinder. This approach employs a dual-stream network architecture, harnessing the potential of large language models to advance the field of medical image retrieval beyond existing text-image retrieval solutions. It marks our preliminary step towards developing a system capable of facilitating text-to-image, image-to-text, and keyword-based retrieval tasks. Our project is available at \url{https://huggingface.co/datasets/cyd0806/BIMCV-R}.

Yinda Chen, Che Liu, Xiaoyu Liu, Rossella Arcucci, Zhiwei Xiong• 2024

Related benchmarks

Task	Dataset	Result
Text-to-Image Retrieval	BIMCV-R	R@1280	10
Text-to-Image Retrieval	MSKBrain	R@115.5	9
Image-to-Text Retrieval	MSKBrain	R@115.2	9
Image-to-Text Retrieval	BIMCV-R lung CT (test)	R@10.029	6
Image-to-Text Retrieval	BIMCV-R	R@12.9	4
Keyword-based 3D Medical Image Retrieval	BIMCV-R	Atelectasis P@2075	4

Showing 6 of 6 rows

Other info

Code

Follow for update

@wizwand_team Discord