PA-Net: Precipitation-Adaptive Mixture-of-Experts for Long-Tail Rainfall Nowcasting
About
Precipitation nowcasting is vital for flood warning, agricultural management, and emergency response, yet two bottlenecks persist: the prohibitive cost of modeling million-scale spatiotemporal tokens from multi-variate atmospheric fields, and the extreme long-tailed rainfall distribution where heavy-to-torrential events -- those of greatest societal impact -- constitute fewer than 0.1% of all samples. We propose the Precipitation-Adaptive Network (PA-Net), a Transformer framework whose computational budget is explicitly governed by rainfall intensity. Its core component, Precipitation-Adaptive MoE (PA-MoE), dynamically scales the number of activated experts per token according to local precipitation magnitude, channeling richer representational capacity toward the rare yet critical heavy-rainfall tail. A Dual-Axis Compressed Latent Attention mechanism factorizes spatiotemporal attention with convolutional reduction to manage massive context lengths, while an intensity-aware training protocol progressively amplifies learning signals from extreme-rainfall samples. Experiment on ERA5 demonstrate consistent improvements over state-of-the-art baselines, with particularly significant gains in heavy-rain and rainstorm regimes.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Precipitation nowcasting | ERA5 | Temporal Score24.62 | 98 | |
| Precipitation nowcasting | ERA5 0~1h forecast window | IoU53.78 | 13 | |
| Precipitation nowcasting | ERA5 1~2h forecast window | IoU49.94 | 13 | |
| Precipitation nowcasting | ERA5 2~3h forecast window | IoU45.58 | 13 | |
| Precipitation nowcasting | ERA5 3~4h forecast window | IoU43.72 | 13 | |
| Precipitation nowcasting | ERA5 4~5h forecast window | IoU40.49 | 13 | |
| Precipitation nowcasting | ERA5 5~6h forecast window | IoU37.57 | 13 |