Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
STEM Reasoning on GPQA (Pass@1)
Loading...
77.06
Pass@1 Accuracy
GPT-OSS-120B
60.472
64.7785
69.085
73.3915
Apr 10, 2026
Pass@1 Accuracy
Updated 5d ago
Evaluation Results
Method
Method
Links
Pass@1 Accuracy
GPT-OSS-120B
Sampling Strategy=4-sa...
2026.04
77.06
GPT-5 Mini
Sampling Strategy=4-sa...
2026.04
75.46
Gemini 2.5 Flash
Sampling Strategy=4-sa...
2026.04
75.09
Aryabhata 2
Sampling Strategy=4-sa...
2026.04
74.86
Qwen3-30B-A3B (Thinking)
Sampling Strategy=4-sa...
2026.04
73.31
GPT-OSS-20B
Sampling Strategy=4-sa...
2026.04
70.51
Nemotron 3 Nano 30B A3B
Sampling Strategy=4-sa...
2026.04
65.38
GPT-5 Nano
Sampling Strategy=4-sa...
2026.04
61.11
Feedback
Search any
task
Search any
task