Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Math Reasoning on AIME 24 (Pass@8)
Loading...
86.7
Pass@8 Score
RFT
76.3
79
81.7
84.4
Apr 13, 2026
Pass@8 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@8 Score
RFT
Base Model=Qwen3-4B-In...
2026.04
86.7
SRT
Base Model=Qwen3-4B-In...
2026.04
86.7
SD-ZERO
Base Model=Qwen3-4B-In...
2026.04
86.7
SFT
Base Model=Qwen3-4B-In...
2026.04
83.3
SFT
Base Model=Olmo-3-7B-I...
2026.04
83.3
SRT
Base Model=Olmo-3-7B-I...
2026.04
83.3
SD-ZERO
Base Model=Olmo-3-7B-I...
2026.04
83.3
GRPO
Base Model=Qwen3-4B-In...
2026.04
80
SDFT
Base Model=Qwen3-4B-In...
2026.04
80
Olmo-3-7B-Instruct
Base Model=Olmo-3-7B-I...
2026.04
80
RFT
Base Model=Olmo-3-7B-I...
2026.04
80
GRPO
Base Model=Olmo-3-7B-I...
2026.04
80
SDFT
Base Model=Olmo-3-7B-I...
2026.04
80
Qwen3-4B-Instruct
Base Model=Qwen3-4B-In...
2026.04
76.7
Feedback
Search any
task
Search any
task