Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Time Series Reasoning on TimeSeriesExam
Loading...
0.78
Similarity Score
GPT-4o
0.5928
0.6414
0.69
0.7386
Apr 11, 2026
Similarity Score
Pattern Recognition Score
Causality Score
Noise Handling Score
Anomaly Detection Score
Overall Score
Updated 5d ago
Evaluation Results
Method
Method
Links
Similarity Score
Pattern Recognition Score
Causality Score
Noise Handling Score
Anomaly Detection Score
Overall Score
GPT-4o
2026.04
0.78
0.78
0.61
0.77
0.54
0.73
Gemini-2.5-Pro
2026.04
0.78
0.76
0.57
0.65
0.59
0.71
Gemma-3-27B-IT
Parameters=27B
2026.04
0.62
0.59
0.51
0.71
0.51
0.59
Qwen2.5-VL-72B
Parameters=72B
2026.04
0.6
0.45
0.49
0.4
0.22
0.44
Feedback
Search any
task
Search any
task