Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on AIMO3 competition
Loading...
43
Score
STOP
38.84
39.92
41
42.08
Apr 17, 2026
Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Score
STOP
base_model=GPT-OSS-120...
2026.04
43
STOP
base_model=GPT-OSS-120...
2026.04
42
Baseline + Tool
base_model=GPT-OSS-120B
2026.04
39
Feedback
Search any
task
Search any
task