Tree of Concepts: Interpretable Continual Learners in Non-Stationary Clinical Domains
About
Continual learning aims to update models under distribution shift without forgetting, yet many high-stakes deployments, such as healthcare, also require interpretability. In practice, models that adapt well (e.g., deep networks) are often opaque, while models that are interpretable (e.g., decision trees) are brittle under shift, making it difficult to achieve both properties simultaneously. In response, we propose Tree of Concepts, an interpretable continual learning framework that uses a shallow decision tree to define a fixed, rule-based concept interface and trains a concept bottleneck model to predict these concepts from raw features. Continual updates act on the concept extractor and label head while keeping concept semantics stable over time, yielding explanations that do not drift across sequential updates. On multiple tabular healthcare benchmarks under continual learning protocols, our method achieves a stronger stability-plasticity trade-off than existing baselines, including replay-enhanced variants. Our results suggest that structured concept interfaces can support continual adaptation while preserving a consistent audit interface in non-stationary, high-stakes domains.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3-class classification | CDC Diabetes Health Indicators | Macro-F1 (Full-data UB)0.66 | 7 | |
| In-hospital mortality prediction | MIMIC-III continual protocol A (time-window shift by admission year) | Full-data UB Score0.86 | 7 | |
| In-hospital mortality prediction | MIMIC-III continual protocol (B) demographic shift | AUROC (Full-data UB)0.85 | 7 | |
| Binary Classification | UCI Heart Disease | Full-data UB (AUROC)0.89 | 7 |