Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on GPQA Diamond (Accuracy and Performance)
Loading...
86.3
Throughput (Req/s)
SQUEEZE EVOLVE (p=0)
-1.32
21.4275
44.175
66.9225
Apr 9, 2026
Throughput (Req/s)
Speedup
Accuracy (%)
Updated 9d ago
Evaluation Results
Method
Method
Links
Throughput (Req/s)
Speedup
Accuracy (%)
SQUEEZE EVOLVE (p=0)
Model 1=GPT-OSS-20B, M...
2026.04
86.3
2.84
79
SQUEEZE EVOLVE (p=10)
Model 1=GPT-OSS-20B, M...
2026.04
53.54
1.76
79.5
RSA
Model 1=GPT-OSS-20B, M...
2026.04
30.34
1
79.6
SQUEEZE EVOLVE (p=0)
Model 1=Qwen3-30B-A3B-...
2026.04
22
10.71
83.8
SQUEEZE EVOLVE (p=10)
Model 1=Qwen3-30B-A3B-...
2026.04
8.17
3.98
84
RSA
Model 1=Qwen3-30B-A3B-...
2026.04
2.05
1
84.3
Feedback
Search any
task
Search any
task