Heavy-Tailed Diffusion Models
About
Diffusion models achieve state-of-the-art generation quality across many applications, but their ability to capture rare or extreme events in heavy-tailed distributions remains unclear. In this work, we show that traditional diffusion and flow-matching models with standard Gaussian priors fail to capture heavy-tailed behavior. We address this by repurposing the diffusion framework for heavy-tail estimation using multivariate Student-t distributions. We develop a tailored perturbation kernel and derive the denoising posterior based on the conditional Student-t distribution for the backward process. Inspired by $\gamma$-divergence for heavy-tailed distributions, we derive a training objective for heavy-tailed denoisers. The resulting framework introduces controllable tail generation using only a single scalar hyperparameter, making it easily tunable for diverse real-world distributions. As specific instantiations of our framework, we introduce t-EDM and t-Flow, extensions of existing diffusion and flow models that employ a Student-t prior. Remarkably, our approach is readily compatible with standard Gaussian diffusion models and requires only minimal code changes. Empirically, we show that our t-EDM and t-Flow outperform standard diffusion models in heavy-tail estimation on high-resolution weather datasets in which generating rare and extreme events is crucial.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Extreme-Event Time-Series Generation | LTST-ECG | EM-W10.0331 | 10 | |
| Time-series generation | WEA-TEMP | Wasserstein Distance1.0998 | 10 | |
| Time-series generation | LTST-ECG | Wasserstein Distance0.0443 | 10 | |
| Extreme-Event Generation | WEA-TEMP (test) | EM-W10.4115 | 10 | |
| Extreme-Event Time-Series Generation | HH-Power | EM-W10.8059 | 10 | |
| Time-series generation | Syn-Data | Wasserstein Distance0.0231 | 10 | |
| Time-series generation | WEA Prec | Wasserstein Distance158.3 | 10 | |
| Time-series generation | PEMS-SF | EM (W1)0.0217 | 10 | |
| Time-series generation | HH-Power | Wasserstein Distance0.1856 | 10 | |
| Time-series generation | PEMS-SF (test) | Wasserstein Distance0.0051 | 10 |