Conservative Continuous-Time Treatment Optimization
About
We develop a conservative continuous-time stochastic control framework for treatment optimization from irregularly sampled patient trajectories. The unknown patient dynamics are modeled as a controlled stochastic differential equation with treatment as a continuous-time control. Naive model-based optimization can exploit model errors and propose out-of-support controls, so optimizing the estimated dynamics may not optimize the true dynamics. To limit extrapolation, we add a consistent signature-based MMD regularizer on path space that penalizes treatment plans whose induced trajectory distribution deviates from observed trajectories. The resulting objective minimizes a computable upper bound on the true cost. Experiments on benchmark datasets show improved robustness and performance compared to non-conservative baselines.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Continuous-Time Treatment Optimization | Cancer Explicit | Avg Spearman Correlation0.4 | 3 | |
| Continuous-Time Treatment Optimization | Cancer Relative | Avg Spearman Correlation0.23 | 3 | |
| Treatment Optimization | Cancer Explicit | Average True Cost242.1 | 3 | |
| Treatment Optimization | Cancer Relative | Avg True Cost73.42 | 3 | |
| Treatment Optimization | Covid-19 Target Trajectory | Average True Cost5.61 | 3 | |
| Continuous-Time Treatment Optimization | Covid-19 Target Trajectory | Avg Spearman Correlation0.51 | 3 |