Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-Modal Multi-Agent Reinforcement Learning for Radiology Report Generation: Radiologist-Like Workflow with Clinically Verifiable Rewards

About

We propose MARL-Rad, a novel multi-modal multi-agent reinforcement learning framework for radiology report generation that coordinates region-specific agents and a global integrating agent, optimized via clinically verifiable rewards. Unlike prior single-model reinforcement learning or post-hoc agentization of independently trained models, our method jointly trains multiple agents and optimizes the entire agent system through reinforcement learning. Experiments on the MIMIC-CXR and IU X-ray datasets show that MARL-Rad consistently improves clinically efficacy (CE) metrics such as RadGraph, CheXbert, and GREEN scores, achieving state-of-the-art CE performance. Further analyses confirm that MARL-Rad enhances laterality consistency and produces more accurate, detail-informed reports.

Kaito Baba, Satoshi Kodera• 2026

Related benchmarks

TaskDatasetResultRank
Radiology Report GenerationMIMIC-CXR findings
BLEU-45.6
26
Radiology Report GenerationIU X-ray Findings
BLEU-44.6
21
Radiology Report GenerationMIMIC-CXR Findings + Impression
BLEU-414.2
6
Radiology Report GenerationIU X-ray Findings + Impression
BLEU-40.182
3
Showing 4 of 4 rows

Other info

Follow for update