Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HiMTM: Hierarchical Multi-Scale Masked Time Series Modeling with Self-Distillation for Long-Term Forecasting

About

Time series forecasting is a critical and challenging task in practical application. Recent advancements in pre-trained foundation models for time series forecasting have gained significant interest. However, current methods often overlook the multi-scale nature of time series, which is essential for accurate forecasting. To address this, we propose HiMTM, a hierarchical multi-scale masked time series modeling with self-distillation for long-term forecasting. HiMTM integrates four key components: (1) hierarchical multi-scale transformer (HMT) to capture temporal information at different scales; (2) decoupled encoder-decoder (DED) that directs the encoder towards feature extraction while the decoder focuses on pretext tasks; (3) hierarchical self-distillation (HSD) for multi-stage feature-level supervision signals during pre-training; and (4) cross-scale attention fine-tuning (CSA-FT) to capture dependencies between different scales for downstream tasks. These components collectively enhance multi-scale feature extraction in masked time series modeling, improving forecasting accuracy. Extensive experiments on seven mainstream datasets show that HiMTM surpasses state-of-the-art self-supervised and end-to-end learning methods by a considerable margin of 3.16-68.54\%. Additionally, HiMTM outperforms the latest robust self-supervised learning method, PatchTST, in cross-domain forecasting by a significant margin of 2.3\%. The effectiveness of HiMTM is further demonstrated through its application in natural gas demand forecasting.

Shubao Zhao, Ming Jin, Zhaoxiang Hou, Chengyi Yang, Zengxiang Li, Qingsong Wen, Yi Wang• 2024

Related benchmarks

TaskDatasetResultRank
Multivariate long-term forecastingETTh1
MSE0.355
344
Multivariate long-term series forecastingETTh2
MSE0.332
319
Multivariate long-term series forecastingWeather
MSE0.22
288
Multivariate long-term series forecastingETTm1
MSE0.336
257
Multivariate long-term forecastingElectricity
MSE0.157
183
Multivariate long-term series forecastingETTm2
MSE0.253
175
Multivariate long-term forecastingTraffic
MSE0.384
159
Multivariate long-term forecastingETTm1 (test)
MSE0.28
134
Multivariate long-term forecastingETTh1 (test)
MSE0.358
77
Multivariate long-term forecastingETTh1 T=720 (test)
MSE0.425
51
Showing 10 of 19 rows

Other info

Follow for update