Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AIME (Pass@1 %)

90Pass@1 Accuracy

GPT-OSS-120B

65.674471.989778.30584.6203Apr 10, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.04
90
2026.04
86.67
2026.04
86.67
2026.04
84.58
2026.04
83.33
2026.04
77.08
2026.04
74.17
2026.04
66.61