Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

IndicMedDialog: A Parallel Multi-Turn Medical Dialogue Dataset for Accessible Healthcare in Indic Languages

About

Most existing medical dialogue systems operate in a single-turn question--answering paradigm or rely on template-based datasets, limiting conversational realism and multilingual applicability. We introduce IndicMedDialog, a parallel multi-turn medical dialogue dataset spanning English and nine Indic languages: Assamese, Bengali, Gujarati, Hindi, Marathi, Punjabi, Tamil, Telugu, and Urdu. The dataset extends MDDial with LLM-generated synthetic consultations, translated using TranslateGemma, verified by native speakers, and refined through a script-aware post-processing pipeline to correct phonetic, lexical, and character-spacing errors. Building on this dataset, we fine-tune IndicMedLM via parameter-efficient adaptation of a quantized small language model, incorporating optional patient pre-context to personalise multi-turn symptom elicitation. We evaluate against zero-shot multilingual baselines, conduct systematic error analysis across ten languages, and validate clinical plausibility through medical expert evaluation.

Shubham Kumar Nigam, Suparnojit Sarkar, Piyush Patel• 2026

Related benchmarks

TaskDatasetResultRank
Medical DiagnosisIndicMedDialog English 1.0 (test)
Diagnostic Accuracy80.85
8
Medical DiagnosisIndicMedDialog Hindi 1.0 (test)
Diagnostic Accuracy72.76
8
Medical DiagnosisIndicMedDialog Marathi 1.0 (test)
Diagnostic Accuracy68.51
8
Medical DiagnosisIndicMedDialog Bengali 1.0 (test)
Diagnostic Accuracy58.72
8
Medical DiagnosisIndicMedDialog Urdu 1.0 (test)
Diagnostic Accuracy28.51
8
Medical DiagnosisIndicMedDialog Punjabi 1.0 (test)
Diagnostic Accuracy20.42
8
Medical DiagnosisIndicMedDialog Telugu 1.0 (test)
Diagnostic Accuracy5.96
8
Medical DiagnosisIndicMedDialog Gujarati 1.0 (test)
Diagnostic Accuracy0.1957
8
Medical DiagnosisIndicMedDialog Assamese 1.0 (test)
Diagnostic Accuracy5.96
8
Medical DiagnosisIndicMedDialog Tamil 1.0 (test)
Diagnostic Accuracy6.8
8
Showing 10 of 10 rows

Other info

GitHub

Follow for update