Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AgentScore: Autoformulation of Deployable Clinical Scoring Systems

About

Modern clinical practice relies on evidence-based guidelines implemented as compact scoring systems composed of a small number of interpretable decision rules. While machine-learning models achieve strong performance, many fail to translate into routine clinical use due to misalignment with workflow constraints such as memorability, auditability, and bedside execution. We argue that this gap arises not from insufficient predictive power, but from optimizing over model classes that are incompatible with guideline deployment. Deployable guidelines often take the form of unit-weighted clinical checklists, formed by thresholding the sum of binary rules, but learning such scores requires searching an exponentially large discrete space of possible rule sets. We introduce AgentScore, which performs semantically guided optimization in this space by using LLMs to propose candidate rules and a deterministic, data-grounded verification-and-selection loop to enforce statistical validity and deployability constraints. Across eight clinical prediction tasks, AgentScore outperforms existing score-generation methods and achieves AUC comparable to more flexible interpretable models despite operating under stronger structural constraints. On two additional externally validated tasks, AgentScore achieves higher discrimination than established guideline-based scores.

Silas Ruhrberg Est\'evez, Christopher Chiu, Mihaela van der Schaar• 2026

Related benchmarks

TaskDatasetResultRank
Atrial Fibrillation (AF) predictionMIMIC
AUROC0.81
12
Heart Failure (HF) predictionMIMIC
AUROC0.79
12
Vasopressor (Vaso) use predictioneICU
AUROC0.76
12
Acute Kidney Injury (AKI) predictionMIMIC
AUROC79
12
Cancer PredictionMIMIC
AUROC0.59
12
Chronic Obstructive Pulmonary Disease (COPD) predictionMIMIC
AUROC0.63
12
Lung disease predictionMIMIC
AUROC63
12
Length of Stay (LOS) predictioneICU
AUROC67
12
Showing 8 of 8 rows

Other info

Follow for update