Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Forecasting on Spatio-Temporal Synthetic Dataset 1.0 (test)
Loading...
63.36
MAE
Claude-4.5-Sonnet
62.1332
70.4141
78.695
86.9759
Jan 6, 2026
MAE
Updated 4d ago
Evaluation Results
Method
Method
Links
MAE
Claude-4.5-Sonnet
Category=Proprietary M...
2026.01
63.36
Claude-4.5-Sonnet
Category=Proprietary M...
2026.01
63.74
GPT-5.2
Category=Proprietary M...
2026.01
63.99
GPT-5.2
Category=Proprietary M...
2026.01
64.7
STReasoner-8B (Ours)
Category=Spatio-Tempor...
2026.01
65.59
Qwen3-8B - SFT+S-GRPO
Category=Open-Source M...
2026.01
66.35
Qwen3-8B - SFT
Category=Open-Source M...
2026.01
66.49
Qwen3-VL-8B-Instruct - SFT+S-GRPO
Category=Open-Source M...
2026.01
67.29
Time-R1-7B
Category=Time Series R...
2026.01
68.15
Qwen3-VL-8B-Instruct - SFT
Category=Open-Source M...
2026.01
68.53
Qwen3-VL-8B-Instruct
Category=Open-Source M...
2026.01
74.21
Time-MQA-7B
Category=Time Series L...
2026.01
84.7
ChatTS-8B
Category=Time Series L...
2026.01
85.14
Qwen3-8B
Category=Open-Source M...
2026.01
94.03
Feedback
Search any
task
Search any
task