Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Auto-formalization on MathOlympiad-Bench
Loading...
99.2
Pass@8
LongCat-Flash-Prover
30.664
48.457
66.25
84.043
Mar 22, 2026
Pass@8
Updated 25d ago
Evaluation Results
Method
Method
Links
Pass@8
LongCat-Flash-Prover
w/ TIR=true
2026.03
99.2
Claude-Opus-4.5
2026.03
94.4
LongCat-Flash-Prover
2026.03
93.3
Gemini-3 Pro
2026.03
93.1
Kimi-K2.5
2026.03
91.1
Goedel-V2-Formalizer-32B
2026.03
89.2
DeepSeek-V3.2
2026.03
85.6
ATF-32B
2026.03
83.6
StepFun-Formalizer-32B
2026.03
78.6
ATF-8B-Distilled
2026.03
76.7
Goedel-V2-Formalizer-8B
2026.03
73.2
StepFun-Formalizer-7B
2026.03
71.9
Kimina-Autoformalizer-7B
2026.03
33.3
Feedback
Search any
task
Search any
task