Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Performance on Aggregate Across Math, Code, Chat
Loading...
4.91
Speedup
DFlash
1.5508
2.4229
3.295
4.1671
Feb 5, 2026
Speedup
Tau
Updated 1mo ago
Evaluation Results
Method
Method
Links
Speedup
Tau
DFlash
Model=Q3-4B, Temperatu...
2026.02
4.91
6.54
DFlash
Model=Q3-8B, Temperatu...
2026.02
4.86
6.49
DFlash
Model=Q3-4B, Temperatu...
2026.02
4.24
5.69
DFlash
Model=Q3-8B, Temperatu...
2026.02
4.03
5.48
EAGLE-3
Model=Q3-4B, Temperatu...
2026.02
2.08
3.48
EAGLE-3
Model=Q3-8B, Temperatu...
2026.02
2.02
3.4
EAGLE-3
Model=Q3-4B, Temperatu...
2026.02
1.93
3.36
EAGLE-3
Model=Q3-8B, Temperatu...
2026.02
1.88
3.26
EAGLE-3
Model=Q3-4B, Temperatu...
2026.02
1.81
3.05
EAGLE-3
Model=Q3-8B, Temperatu...
2026.02
1.76
2.96
EAGLE-3
Model=Q3-4B, Temperatu...
2026.02
1.72
2.95
EAGLE-3
Model=Q3-8B, Temperatu...
2026.02
1.68
2.83
Feedback
Search any
task
Search any
task