Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mitigating Data Scarcity in Time Series Analysis: A Foundation Model with Series-Symbol Data Generation

About

Foundation models for time series analysis (TSA) have attracted significant attention. However, challenges such as data scarcity and data imbalance continue to hinder their development. To address this, we consider modeling complex systems through symbolic expressions that serve as semantic descriptors of time series. Building on this concept, we introduce a series-symbol (S2) dual-modulity data generation mechanism, enabling the unrestricted creation of high-quality time series data paired with corresponding symbolic representations. Leveraging the S2 dataset, we develop SymTime, a pre-trained foundation model for TSA. SymTime demonstrates competitive performance across five major TSA tasks when fine-tuned with downstream task, rivaling foundation models pre-trained on real-world datasets. This approach underscores the potential of dual-modality data generation and pretraining mechanisms in overcoming data scarcity and enhancing task performance.

Wenxuan Wang, Kai Wu, Yujian Betterest Li, Dan Wang, Xiaoyu Zhang, Jing Liu• 2025

Related benchmarks

TaskDatasetResultRank
Anomaly DetectionSMD
F1 Score84.62
359
Anomaly DetectionPSM
F1 Score97.07
142
Anomaly DetectionMSL
Precision89.46
95
Anomaly DetectionSMAP
Precision90.34
13
Showing 4 of 4 rows

Other info

Follow for update