Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

M4

Benchmarks

Task NameDataset NameSOTA ResultTrend
Short-term forecastingM4 Quarterly
MASE0.062
141
Short-term forecastingM4 Monthly
MASE0.709
125
Short-term forecastingM4 Yearly
MASE0.091
116
Short-term forecastingM4 (Others)
SMAPE3.789
83
Short-term forecastingM4
SMAPE4.633
74
Short-term time series forecastingM4 Average
SMAPE11.73
53
Short-term forecastingM4 (test)
SMAPE11.701
35
Short-term forecastingM4 Weighted Average
SMAPE11.618
32
Machine-Generated Text DetectionM4
TP @ 20%87.28
32
Time Series ForecastingM4 Daily
MASE2.054
31
Short-term time series forecastingM4 weighted average from all datasets (test)
sMAPE11.829
30
Time Series ForecastingM4 Weekly
MASE0.354
17
Multi-contrast MRI ReconstructionM4raw
PSNR (dB)32.01
16
AI-generated text detectionM4
AUROC92.43
13
ForecastingM4 hourly 48
Relative MAPE0.54
13
ForecastingM4
CRPS0.036
13
Time Series ForecastingM4 Hourly
SMAPE10.84
12
Time Series Forecastingm4 Chronos Benchmark II (yearly)
WQL0.091
12
Time Series Forecastingm4 Chronos Benchmark II (quarterly)
WQL0.062
12
Time-series ForecastingM4 (test)
CRPS0.03
12
Time-series ForecastingM4 Zero-shot
Yearly sMAPE13.53
10
Time Series ForecastingM4 Others
sMAPE6.245
10
Univariate Time Series ForecastingM4
SMAPE11.863
10
Detection of LLM-generated textM4 extension Generation: Polish 4o-mini, Regeneration: Para 4o-mini
ROC AUC @ 1% FPR0.1329
8
Detection of LLM-generated textM4 extension Generation Revise 4.1 Regeneration Para 4o-mini
ROC AUC @ 1% FPR25.26
8
Showing 25 of 58 rows