Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

KiUT: Knowledge-injected U-Transformer for Radiology Report Generation

About

Radiology report generation aims to automatically generate a clinically accurate and coherent paragraph from the X-ray image, which could relieve radiologists from the heavy burden of report writing. Although various image caption methods have shown remarkable performance in the natural image field, generating accurate reports for medical images requires knowledge of multiple modalities, including vision, language, and medical terminology. We propose a Knowledge-injected U-Transformer (KiUT) to learn multi-level visual representation and adaptively distill the information with contextual and clinical knowledge for word prediction. In detail, a U-connection schema between the encoder and decoder is designed to model interactions between different modalities. And a symptom graph and an injected knowledge distiller are developed to assist the report generation. Experimentally, we outperform state-of-the-art methods on two widely used benchmark datasets: IU-Xray and MIMIC-CXR. Further experimental results prove the advantages of our architecture and the complementary benefits of the injected knowledge.

Zhongzhen Huang, Xiaofan Zhang, Shaoting Zhang• 2023

Related benchmarks

TaskDatasetResultRank
Radiology Report GenerationMIMIC-CXR (test)
BLEU-40.113
172
Radiology Report GenerationIU-Xray (test)
ROUGE-L0.409
77
Radiology Report GenerationMIMIC-CXR
ROUGE-L28.5
57
Medical Report GenerationMIMIC-CXR
BLEU-40.113
43
Radiology Report GenerationMIMIC-CXR findings
BLEU-411.3
26
Medical Report GenerationIU X-Ray
BLEU-10.525
21
Radiology Report GenerationIU X-ray Findings
BLEU-418.5
21
Showing 7 of 7 rows

Other info

Follow for update