Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Reasoning on GPQA (Accuracy, Loss)
Loading...
25.8
Accuracy
Adam
24.968
25.184
25.4
25.616
Apr 10, 2026
Accuracy
Loss
Updated 6d ago
Evaluation Results
Method
Method
Links
Accuracy
Loss
Adam
Model scale=1B
2026.04
25.8
2.28
Nexus
Model scale=3B
2026.04
25.8
2.047
Nexus
Model scale=1B
2026.04
25
2.261
Adam
Model scale=3B
2026.04
25
2.062
Feedback
Search any
task
Search any
task