Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Reasoning on GPQA D (Accuracy, Loss)
Loading...
27.3
Accuracy
Adam
18.46
20.755
23.05
25.345
Apr 10, 2026
Accuracy
Loss
Updated 6d ago
Evaluation Results
Method
Method
Links
Accuracy
Loss
Adam
Model scale=3B
2026.04
27.3
1.975
Nexus
Model scale=3B
2026.04
23.4
1.957
Adam
Model scale=1B
2026.04
18.8
2.191
Nexus
Model scale=1B
2026.04
18.8
2.172
Feedback
Search any
task
Search any
task