Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematics on Minerva Math
Loading...
41.8
Pass@1 Accuracy
INSIGHT
20.792
26.246
31.7
37.154
Mar 2, 2026
Pass@1 Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Pass@1 Accuracy
INSIGHT
Model=QWEN3-4B
2026.03
41.8
MoPPS
Model=QWEN3-4B
2026.03
41.59
INVERSE-EVIDENCE
Model=QWEN3-4B
2026.03
41.49
RANDOM
Model=QWEN3-4B
2026.03
41.13
INSIGHT
Model=R1-DISTILL-QWEN-7B
2026.03
39.06
DS
Model=R1-DISTILL-QWEN-7B
2026.03
38.39
EXPECTED-DIFFICULTY
Model=R1-DISTILL-QWEN-7B
2026.03
38.39
MoPPS
Model=R1-DISTILL-QWEN-7B
2026.03
38.02
RANDOM
Model=R1-DISTILL-QWEN-7B
2026.03
37.98
INVERSE-EVIDENCE
Model=R1-DISTILL-QWEN-7B
2026.03
37.93
INSIGHT
Model=QWEN3-0.6B
2026.03
21.94
INVERSE-EVIDENCE
Model=QWEN3-0.6B
2026.03
21.7
RANDOM
Model=QWEN3-0.6B
2026.03
21.6
MoPPS
Model=QWEN3-0.6B
2026.03
21.6
Feedback
Search any
task
Search any
task