Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Evolving Features vs Evolving Entire Trees with GP for Interpretable Survival Analysis

About

Survival analysis concerns the task of predicting the time until an event occurs. Often used in the medical field, survival analysis deals with incomplete (i.e., censored) data, for instance, from patients who did not experience the event during the duration of the study. For practical use, both accuracy and interpretability are important. Survival trees are easy-to-follow survival models that split the patient cohort recursively into discrete patient groups. Whilst survival trees can capture complex relationships, they typically need to grow large, threatening interpretability. Moreover, survival trees are often built using greedy approaches that may overlook globally optimal split combinations, limiting predictive performance. Shallow survival trees require expressive, higher-order feature combinations to achieve competitive accuracy. We therefore use genetic programming to multi-objectively evolve inherently inspectable feature sets and study how they interact with different tree induction strategies. We further introduce an evolutionary approach that jointly optimises the survival tree structure and the non-linear split logic. Our findings demonstrate that evolutionary feature construction improves predictive performance across different tree induction strategies on two real-world datasets and two different survival tree depths. Full joint evolution has the overall highest potential to propose multiple inherently inspectable shallow survival trees of good performance.

Thalea Schlender, Peter A.N. Bosman, Tanja Alderliesten• 2026

Related benchmarks

TaskDatasetResultRank
Survival AnalysisGBSG external (test)
C-index0.684
15
Survival AnalysisMETABRIC external (test)
C-index0.663
15
Showing 2 of 2 rows

Other info

Follow for update