Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MedSpeak: A Knowledge Graph-Aided ASR Error Correction Framework for Spoken Medical QA

About

Spoken question-answering (SQA) systems relying on automatic speech recognition (ASR) often struggle with accurately recognizing medical terminology. To this end, we propose MedSpeak, a novel knowledge graph-aided ASR error correction framework that refines noisy transcripts and improves downstream answer prediction by leveraging both semantic relationships and phonetic information encoded in a medical knowledge graph, together with the reasoning power of LLMs. Comprehensive experimental results on benchmarks demonstrate that MedSpeak significantly improves the accuracy of medical term recognition and overall medical SQA performance, establishing MedSpeak as a state-of-the-art solution for medical SQA. The code is available at https://github.com/RainieLLM/MedSpeak.

Yutong Song, Shiva Shrestha, Chenhan Lyu, Elahe Khatibi, Pengfei Zhang, Honghui Xu, Nikil Dutt, Amir Rahmani• 2026

Related benchmarks

TaskDatasetResultRank
Spoken Medical Question AnsweringMMLU (test)
Clinical95.4
5
Spoken Medical Question AnsweringMedQA (test)
QA Accuracy97.5
5
Spoken Medical Question AnsweringMedMCQA (test)
QA Accuracy91.5
5
ASR Error CorrectionMedQA (test)
WER43.5
3
ASR Error CorrectionMedMCQA (test)
WER28.1
3
ASR Error CorrectionMMLU (test)
Clinical Accuracy22.1
3
Showing 6 of 6 rows

Other info

Follow for update