TreeText-CTS: Compact, Source-Traceable Tree-Path Evidence for Irregular Clinical Time-Series Prediction

About

Numerical time-series models can effectively process irregular electronic health record (EHR) trajectories, but they do not naturally expose the measurements and temporal patterns supporting each risk estimate as readable evidence. Existing text-based interfaces improve readability, but typically rely on either raw serialization, which is lengthy and redundant, or patient-level free-form summaries, which are difficult to trace to source measurements and time windows. To bridge this gap, we introduce TreeText-CTS (Clinical Time-Series), which converts irregular EHR trajectories into human-readable, compact, source-traceable tree-path evidence units without patient-level summarization or inference-time autoregressive decoding. TreeText-CTS routes multi-scale window summaries through frozen XGBoost models and verbalizes activated tree paths as deterministic, source-traceable evidence units composed of threshold conditions. An evidence selector assembles an informative subset of these units, which a language-model encoder then integrates for prediction. Across PhysioNet 2012 mortality, MIMIC-III mortality, and PhysioNet 2019 sepsis-onset forecasting, TreeText-CTS achieves the best AUROC and AUPRC among evaluated text-based EHR time-series interfaces, improving AUPRC by 6.0 to 9.7 absolute percentage points over the strongest prior text-based interface while remaining competitive with numerical time-series models. Ablations show that tree-path evidence construction, evidence selection, and language-model composition each contribute to performance. Because every span passed to the language-model encoder is constructed from activated tree-path threshold conditions, TreeText-CTS makes the evidence supplied to the final predictor inspectable and source-traceable.

Kwanhyung Lee, Juhwan Choi, Jongheon Kim, Joohyung Lee, Hyeongwon Jang, Eunho Yang• 2026

Related benchmarks

Task	Dataset	Result
Clinical prediction	MIMIC-III	AUROC85.79	59
Irregular clinical time-series prediction	PhysioNet 2012 (test)	AUROC0.8571	29
Clinical time series prediction	MIMIC-III (test)	AUROC85.79	18
Clinical time series prediction	PhysioNet 2019 (test)	AUROC0.9066	18
Sepsis Forecasting	PhysioNet 2019	AUROC0.9066	17
Clinical Outcome Prediction	PhysioNet 2012	AUROC0.8571	17
Irregularly Sampled Multivariate Time Series (ISMTS) classification	MIMIC-III (test)	AUROC85.79	9
Irregularly Sampled Multivariate Time Series (ISMTS) classification	PhysioNet 2019 (test)	AUROC90.66	9
Irregularly Sampled Multivariate Time Series (ISMTS) classification	PhysioNet 2012 (test)	AUROC0.8571	9

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord