CAST: Causal Anchored Simplex Transport for Distribution-Valued Time Series
About
Many decision-facing stochastic systems are observed through aggregate distributions rather than scalar trajectories: queue occupancies, mobility shares, public-health mixtures, generation-source shares, ecological compositions, and air-quality severity profiles all live on the probability simplex and evolve over time. We study causal (online) forecasting for these distribution-valued time series and argue that the transition operator itself should be structured around the simplex. We introduce CAST (Causal Anchored Simplex Transport), a successor-local operator that (i) retrieves empirical successors from causal context, (ii) stabilizes them with a persistence anchor, and (iii) applies a bounded local stochastic transport on ordered supports; every stage preserves the simplex by construction. We identify a structural failure mode, latent transition-kernel aliasing, where similar observed distributions evolve differently under different contextual regimes, and prove that any forecaster depending only on an aliased summary incurs an irreducible weighted Jensen-Shannon excess-risk lower bound, while the CAST hypothesis class contains the regime-aware Bayes successor; for ordered supports an additional Pinsker separation holds whenever the transported successor lies outside the no-transport anchor hull. On eleven public and simulated benchmarks spanning ecology, energy, diet, mortality, employment, air quality, severe weather, mobility, and G/G/1, G_t/G/1 queue occupancy, CAST attains the best average rank on both one-step KL (1.27) and autoregressive rollout JSD (1.91), winning 8/11 sections on each metric against a broad statistical, compositional, recurrent, convolutional, and Transformer baseline set, and top-2 on all 11 sections for offline KL. Component ablations and a controlled synthetic aliasing experiment corroborate the theory.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Time Series Forecasting | 11 benchmark sections (BioTIME, Ember Monthly, OWID Dietary, CDC Weekly Deaths, BLS QCEW, EPA AirData AQI, NOAA Storm Events, NYC TLC Trip, Queue Homogeneous, Queue Nonhomogeneous, Queue Combined) (average across sections) | KLavg1.27 | 16 | |
| Time-series modeling | BioTIME | Offline KL0.1313 | 2 | |
| Time-series modeling | Ember Monthly | Offline KL0.0434 | 2 | |
| Time-series modeling | OWID Dietary | Offline KL0.0091 | 2 | |
| Time-series modeling | CDC Weekly Deaths | Offline KL Divergence0.0108 | 2 | |
| Time-series modeling | BLS QCEW | Offline KL8.77e-4 | 2 | |
| Time-series modeling | NYC TLC Trip | Offline KL0.0217 | 2 | |
| Time-series modeling | Queue Nonhomogeneous | Offline KL0.071 | 2 | |
| Time-series modeling | Queue Combined | Offline KL0.0075 | 2 | |
| Time-series modeling | EPA AirData AQI | Offline KL Divergence0.1184 | 2 |