RaDialog: A Large Vision-Language Model for Radiology Report Generation and Conversational Assistance

About

Conversational AI tools that can generate and discuss clinically correct radiology reports for a given medical image have the potential to transform radiology. Such a human-in-the-loop radiology assistant could facilitate a collaborative diagnostic process, thus saving time and improving the quality of reports. Towards this goal, we introduce RaDialog, the first thoroughly evaluated and publicly available large vision-language model for radiology report generation and interactive dialog. RaDialog effectively integrates visual image features and structured pathology findings with a large language model (LLM) while simultaneously adapting it to a specialized domain using parameter-efficient fine-tuning. To keep the conversational abilities of the underlying LLM, we propose a comprehensive, semi-automatically labeled, image-grounded instruct dataset for chest X-ray radiology tasks. By training with this dataset, our method achieves state-of-the-art clinical correctness in report generation and shows impressive abilities in interactive tasks such as correcting reports and answering questions, serving as a foundational step toward clinical dialog systems. Our code is available on github: https://github.com/ChantalMP/RaDialog.

Chantal Pellegrini, Ege \"Ozsoy, Benjamin Busam, Nassir Navab, Matthias Keicher• 2023

Related benchmarks

Task	Dataset	Result
Radiology Report Generation	MIMIC-CXR (test)	ROUGE-L0.271	209
Medical Visual Question Answering	SLAKE (test)	--	67
Medical Report Generation	MIMIC-CXR	BLEU-40.095	43
Medical Visual Question Answering	MIMIC-CXR-VQA (test)	Accuracy0.00e+0	22
Findings Generation	SRRG-Findings unaligned (test)	BLEU1.42	8
Medical Report Generation	MIMIC-CXR All views (test)	BLEU-49.5	8
Findings Generation	SRRG-Findings unaligned (val)	BLEU1.47	4
Impression Generation	SRRG-Impression 1.0 (val)	BLEU5.35	4
Impression Generation	SRRG-Impression 1.0 (test)	BLEU3.32	4
Impression Generation	SRRG-Impression 1.0 (test reviewed)	BLEU3.33	4

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord