Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Explanation Generation on TSQueryBench (test)
Loading...
0.94
Linear Spike Score
Qwen 3 8B
0.4824
0.6012
0.72
0.8388
Apr 2, 2026
Linear Spike Score
Seasonal Drop Score
Structural Break Score
Multi-Metric Consistency Score
Relative Extremum Score
Mean Shift Score
Volatility Shift Score
Updated 2mo ago
Evaluation Results
Method
Method
Links
Linear Spike Score
Seasonal Drop Score
Structural Break Score
Multi-Metric Consistency Score
Relative Extremum Score
Mean Shift Score
Volatility Shift Score
Qwen 3 8B
Model size=8B
2026.04
0.94
0.82
0.96
0.7
0.08
0.96
0
LLaMA 3.1 8B
Model size=8B
2026.04
0.7
0.12
0.94
0.46
0.36
0.28
0
Gemma 2 9B
Model size=9B
2026.04
0.5
0
0.9
0.4
0.35
0.15
0
Feedback
Search any
task
Search any
task