Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Aggregate Performance on Average All Benchmarks
Loading...
71
Accuracy
Dense
30.856
41.278
51.7
62.122
Jul 29, 2025
Accuracy
Throughput
Updated 19d ago
Evaluation Results
Method
Method
Links
Accuracy
Throughput
Dense
Model=QwQ 32B
2025.07
71
121.07
Dense
Model=Phi 4 reasoning...
2025.07
69.8
269.86
ReasonCache
Model=QwQ 32B
2025.07
69.3
192.85
Dense
Model=DeepSeek R1 Dist...
2025.07
68.6
162.82
ReasonCache
Model=Phi 4 reasoning...
2025.07
68.6
386.7
ReasonCache
Model=DeepSeek R1 Dist...
2025.07
66.5
254.53
Random
Model=DeepSeek R1 Dist...
2025.07
36.7
251.15
Random
Model=QwQ 32B
2025.07
36.4
192.4
Random
Model=Phi 4 reasoning...
2025.07
32.4
385.05
Feedback
Search any
task
Search any
task