Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ClinAlign: Scaling Healthcare Alignment from Clinician Preference

About

Although large language models (LLMs) demonstrate expert-level medical knowledge, aligning their open-ended outputs with fine-grained clinician preferences remains challenging. Existing methods often rely on coarse objectives or unreliable automated judges that are weakly grounded in professional guidelines. We propose a two-stage framework to address this gap. First, we introduce HealthRubrics, a dataset of 7,034 physician-verified preference examples in which clinicians refine LLM-drafted rubrics to meet rigorous medical standards. Second, we distill these rubrics into HealthPrinciples: 119 broadly reusable, clinically grounded principles organized by clinical dimensions, enabling scalable supervision beyond manual annotation. We use HealthPrinciples for (1) offline alignment by synthesizing rubrics for unlabeled queries and (2) an inference-time tool for guided self-revision. A 30B-A3B model trained with our framework achieves 33.4% on HealthBench-Hard, outperforming much larger models including Deepseek-R1 and o3, establishing a resource-efficient baseline for clinical alignment.

Shiwei Lyu, Xidong Wang, Lei Liu, Hao Zhu, Chaohe Zhang, Jian Wang, Jinjie Gu, Benyou Wang, Yue Shen• 2026

Related benchmarks

TaskDatasetResultRank
Creative WritingArena-Hard Creative Writing v2
Score79.4
25
General Instruction FollowingArena-Hard v2
Score74.6
23
Medical Question AnsweringHealthBench Hard
Score33.4
16
Medical Question AnsweringHealthBench Overall
Overall Score59.5
16
Medical LLM EvaluationLLMEval Med
Reasoning Score59.8
16
Showing 5 of 5 rows

Other info

GitHub

Follow for update