Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

UniDiff: A Unified Diffusion Framework for Multimodal Time Series Forecasting

About

As multimodal data proliferates across diverse real-world applications, leveraging heterogeneous information such as texts and timestamps for accurate time series forecasting (TSF) has become a critical challenge. While diffusion models demonstrate exceptional performance in generation tasks, their application to TSF remains largely confined to modeling single-modality numerical sequences, overlooking the abundant cross-modal signals inherent in complex heterogeneous data. To address this gap, we propose UniDiff, a unified diffusion framework for multimodal time series forecasting. To process the numerical sequence, our framework first tokenizes the time series into patches, preserving local temporal dynamics by mapping each patch to an embedding space via a lightweight MLP. At its core lies a unified and parallel fusion module, where a single cross-attention mechanism adaptively weighs and integrates structural information from timestamps and semantic context from texts in one step, enabling a flexible and efficient interplay between modalities. Furthermore, we introduce a novel classifier-free guidance mechanism designed for multi-source conditioning, allowing for decoupled control over the guidance strength of textual and temporal information during inference, which significantly enhances model robustness. Extensive experiments on real-world benchmark datasets across eight domains demonstrate that the proposed UniDiff model achieves state-of-the-art performance.

Da Zhang, Bingyu Li, Zhuyuan Zhao, Junyu Gao, Feiping Nie, Xuelong Li• 2025

Related benchmarks

TaskDatasetResultRank
Multimodal Time Series ForecastingTimeMMD Agriculture domain: Retail Broiler Composite 1.0 (test)
MSE0.128
10
Multimodal Time Series ForecastingTimeMMD Climate domain: Drought Level 1.0 (test)
MSE1.049
3
Multimodal Time Series ForecastingTimeMMD Economy domain: International Trade Balance 1.0 (test)
MSE0.015
3
Multimodal Time Series ForecastingTimeMMD Energy domain: Gasoline Prices 1.0 (test)
MSE0.102
3
Multimodal Time Series ForecastingTimeMMD Social Good domain: Unemployment Rate 1.0 (test)
MSE0.729
3
Multimodal Time Series ForecastingTimeMMD Environment domain: Air Quality Index 1.0 (test)
MSE0.284
3
Multimodal Time Series ForecastingTimeMMD Health domain: Influenza Patients Proportion 1.0 (test)
MSE1.002
2
Multimodal Time Series ForecastingTimeMMD Traffic domain: Travel Volume 1.0 (test)
MSE0.179
2
Showing 8 of 8 rows

Other info

Follow for update