Conservative Continuous-Time Treatment Optimization

About

We develop a conservative continuous-time stochastic control framework for treatment optimization from irregularly sampled patient trajectories. The unknown patient dynamics are modeled as a controlled stochastic differential equation with treatment as a continuous-time control. Naive model-based optimization can exploit model errors and propose out-of-support controls, so optimizing the estimated dynamics may not optimize the true dynamics. To limit extrapolation, we add a consistent signature-based MMD regularizer on path space that penalizes treatment plans whose induced trajectory distribution deviates from observed trajectories. The resulting objective minimizes a computable upper bound on the true cost. Experiments on benchmark datasets show improved robustness and performance compared to non-conservative baselines.

Nora Schneider, Georg Manten, Niki Kilbertus• 2026

Related benchmarks

Task	Dataset	Result
Continuous-Time Treatment Optimization	Cancer Explicit	Avg Spearman Correlation0.4	3
Continuous-Time Treatment Optimization	Cancer Relative	Avg Spearman Correlation0.23	3
Treatment Optimization	Cancer Explicit	Average True Cost242.1	3
Treatment Optimization	Cancer Relative	Avg True Cost73.42	3
Treatment Optimization	Covid-19 Target Trajectory	Average True Cost5.61	3
Continuous-Time Treatment Optimization	Covid-19 Target Trajectory	Avg Spearman Correlation0.51	3

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord