Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Generalizing Multi-Scale Time-Series Modeling with a Single Operator

About

Multi-scale modeling has emerged as an effective design principle for time-series forecasting by capturing temporal dynamics at multiple resolutions. As no principled foundation has been established in the literature, we unify existing scaling methods into a scaling operator family, revealing a fundamental limitation of existing approaches: reliance on fixed and discrete scaling. To address this limitation, we propose SiGMA (Single Generalized Multi-scale Architecture), which enables distance-aware scaling via the learnable discrete Gaussian (LDG) kernel grounded in scale-space theory. We evaluate SiGMA comprehensively on long- and short-term forecasting benchmarks against state-of-the-art multi-scale baselines. SiGMA outperforms all competitors on both tasks, especially achieving the best performance in 13 out of 16 long-term evaluation settings. Beyond accuracy, SiGMA significantly improves training speed by up to 5.3 times and reduces memory consumption by up to 3.8 times over the strongest competitors. Code is available at https://github.com/cheonwoolee/SiGMA.

Cheonwoo Lee, Dooho Lee, Doyun Choi, Jaemin Yoo• 2026

Related benchmarks

TaskDatasetResultRank
Long-term time-series forecastingETTh1 (test)
MSE0.443
410
Long-term time-series forecastingWeather (test)
MSE0.247
223
Long-term time-series forecastingETTh2 (test)
MSE0.376
216
Long-term time-series forecastingETTm1 (test)
MSE0.383
199
Long-term time-series forecastingTraffic (test)
MSE0.458
182
Short-term forecastingM4 Yearly
MASE2.989
168
Short-term forecastingM4 Quarterly
MASE1.177
166
Short-term forecastingM4 Monthly
MASE0.936
150
Long-term forecastingExchange (test)
MAE0.4
144
Long-term time-series forecastingETTm2 (test)
MSE0.276
134
Showing 10 of 22 rows

Other info

Follow for update