Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Neural Signals Generate Clinical Notes in the Wild

About

Generating clinical reports that summarize abnormal patterns, diagnostic findings, and clinical interpretations from long-term EEG recordings remains labor-intensive. We present CELM, the first clinical EEG-to-Language foundation model capable of summarizing long-duration, variable-length EEG recordings and performing end-to-end clinical report generation at multiple scales. CELM integrates pretrained EEG foundation models with language models to enable scalable multimodal learning. We curate a large-scale clinical EEG dataset containing 9,922 reports paired with approximately 11,000 hours of EEG recordings from 9,048 patients to train CELM, and release the benchmark with an automated report-structuring pipeline to facilitate future research. Experimental results show that CELM consistently outperforms existing methods across all evaluation settings. Importantly, we further conduct human evaluation with clinical experts, demonstrating that CELM generates reports that are more clinically coherent, diagnostically reliable, and better aligned with expert interpretation. We release our model and benchmark construction pipeline at https://github.com/Jathurshan0330/CELM.

Jathurshan Pradeepkumar, Zheng Chen, Jimeng Sun• 2026

Related benchmarks

TaskDatasetResultRank
Clinical Report GenerationS0001 v1 (test)
BLEU-148.23
23
Clinical Report GenerationS0002 v1 (test)
BLEU-10.5695
23
EEG report generationS0001 samples with clinical context
BLEU-10.4823
23
Clinical Report GenerationS0002 with clinical context, Unimodal + Text + EEG Features (test)
BLEU-156.95
12
Clinical Report GenerationS0002
BLEU-10.4652
12
Clinical Report GenerationS0002 (test)
BLEU-10.4652
12
Clinical Report GenerationS0002 with clinical context, Unimodal + Text Only (test)--
11
Showing 7 of 7 rows

Other info

Follow for update