Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

METransformer: Radiology Report Generation by Transformer with Multiple Learnable Expert Tokens

About

In clinical scenarios, multi-specialist consultation could significantly benefit the diagnosis, especially for intricate cases. This inspires us to explore a "multi-expert joint diagnosis" mechanism to upgrade the existing "single expert" framework commonly seen in the current literature. To this end, we propose METransformer, a method to realize this idea with a transformer-based backbone. The key design of our method is the introduction of multiple learnable "expert" tokens into both the transformer encoder and decoder. In the encoder, each expert token interacts with both vision tokens and other expert tokens to learn to attend different image regions for image representation. These expert tokens are encouraged to capture complementary information by an orthogonal loss that minimizes their overlap. In the decoder, each attended expert token guides the cross-attention between input words and visual tokens, thus influencing the generated report. A metrics-based expert voting strategy is further developed to generate the final report. By the multi-experts concept, our model enjoys the merits of an ensemble-based approach but through a manner that is computationally more efficient and supports more sophisticated interactions among experts. Experimental results demonstrate the promising performance of our proposed model on two widely used benchmarks. Last but not least, the framework-level innovation makes our work ready to incorporate advances on existing "single-expert" models to further improve its performance.

Zhanyu Wang, Lingqiao Liu, Lei Wang, Luping Zhou• 2023

Related benchmarks

TaskDatasetResultRank
Radiology Report GenerationMIMIC-CXR (test)
BLEU-40.124
121
Radiology Report GenerationIU-Xray (test)
ROUGE-L0.38
55
Medical Report GenerationMIMIC-CXR
BLEU-40.124
43
Medical Report GenerationMIMIC-CXR (test)
ROUGE-L0.291
39
Medical Report GenerationIU-Xray (test)
ROUGE-L0.38
34
Radiology Report GenerationMIMIC-CXR
ROUGE-L29.1
32
Medical Report GenerationMIMIC-CXR
F1 Score31.1
22
Medical Report GenerationIU X-Ray
BLEU-10.483
21
Radiology Report GenerationChest (test)
B40.172
13
Showing 9 of 9 rows

Other info

Follow for update