Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Rating Quality of Diverse Time Series Data by Meta-learning from LLM Judgment

About

High-quality time series (TS) data are essential for ensuring TS model performance, rendering research on rating TS data quality indispensable. Existing methods have shown promising rating accuracy within individual domains, primarily by extending data quality rating techniques such as influence functions and Shapley values to account for temporal characteristics. However, they neglect the fact that real-world TS data can span vastly different domains and exhibit distinct properties, hampering the accurate and efficient rating of diverse TS data. In this paper, we propose TSRating, a novel and unified framework for rating the quality of time series data crawled from diverse domains. TSRating leverages LLMs' inherent ample knowledge, acquired during their extensive pretraining, to comprehend and discern quality differences in diverse TS data. We verify this by devising a series of prompts to elicit quality comparisons from LLMs for pairs of TS samples. We then fit a dedicated rating model, termed TSRater, to convert the LLMs' judgments into efficient quality predictions by inferring future TS samples through TSRater's inference. To ensure cross-domain adaptability, we develop a meta-learning scheme to train TSRater on quality comparisons collected from nine distinct domains. To improve training efficiency, we employ signSGD for inner-loop updates, thus circumventing the demanding computation of hypergradients. Extensive experimental results on eleven benchmark datasets across three time series tasks, each using both conventional TS models and TS foundation models, demonstrate that TSRating outperforms baselines in terms of estimation accuracy, efficiency, and domain adaptability.

Shunyu Wu, Dan Li, Wenjie Feng, Haozheng Ye, Jian Lou, See-Kiong Ng• 2025

Related benchmarks

TaskDatasetResultRank
Short-term forecastingM4 Monthly--
125
Short-term forecastingM4 Yearly--
116
Long-term forecastingTraffic--
65
Time-series classificationHandwriting
Accuracy21.3
62
Time Series ForecastingM4 Daily--
31
Long-term forecastingWeather
RMSE0.433
30
Long-term forecastingExchange Rate (ExRate)
RMSE0.213
30
Anomaly DetectionMSL (test)
F1 Score85.06
20
Time Series ForecastingTraffic
RMSE0.339
20
ClassificationMedicalImages (MImg)
Accuracy57.2
18
Showing 10 of 25 rows

Other info

Follow for update