R2GenGPT: Radiology Report Generation with Frozen LLMs

About

Large Language Models (LLMs) have consistently showcased remarkable generalization capabilities when applied to various language tasks. Nonetheless, harnessing the full potential of LLMs for Radiology Report Generation (R2Gen) still presents a challenge, stemming from the inherent disparity in modality between LLMs and the R2Gen task. To bridge this gap effectively, we propose R2GenGPT, which is a novel solution that aligns visual features with the word embedding space of LLMs using an efficient visual alignment module. This innovative approach empowers the previously static LLM to seamlessly integrate and process image information, marking a step forward in optimizing R2Gen performance. R2GenGPT offers the following benefits. First, it attains state-of-the-art (SOTA) performance by training only the lightweight visual alignment module while freezing all the parameters of LLM. Second, it exhibits high training efficiency, as it requires the training of an exceptionally minimal number of parameters while achieving rapid convergence. By employing delta tuning, our model only trains 5M parameters (which constitute just 0.07\% of the total parameter count) to achieve performance close to the SOTA levels. Our code is available at https://github.com/wang-zhanyu/R2GenGPT.

Zhanyu Wang, Lingqiao Liu, Lei Wang, Luping Zhou• 2023

Related benchmarks

Task	Dataset	Result
Radiology Report Generation	MIMIC-CXR (test)	ROUGE-L0.297	209
Radiology Report Generation	IU-Xray (test)	ROUGE-L0.376	110
Medical Report Generation	MIMIC-CXR (test)	ROUGE-L0.285	100
Radiology Report Generation	CheXpert Plus (test)	Precision0.315	88
Medical Report Generation	IU-Xray (test)	ROUGE-L0.376	56
Radiology Report Generation	CT-RATE (test)	BL-10.166	49
Medical Report Generation	MIMIC-CXR	BLEU-40.134	43
Medical Report Generation	IU X-Ray (test)	BLEU-417.3	41
CT Report Generation	CTRG-Chest-548K (test)	BLEU-430.1	40
Radiology Report Generation	CHEXPERT Plus	R-L0.266	37

Showing 10 of 35 rows

Other info

Follow for update

@wizwand_team Discord