| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Putnam-Bench | LongCat-Flash-Prover | Pass@898.1 | 13 | 25d ago | |
| Prover-Bench | LongCat-Flash-Prover | Pass@8100 | 13 | 25d ago | |
| ProofNet (test) | LongCat-Flash-Prover | Pass@897.9 | 13 | 25d ago | |
| MiniF2F (test) | LongCat-Flash-Prover | Pass@8100 | 13 | 25d ago | |
| MathOlympiad-Bench | LongCat-Flash-Prover | Pass@899.2 | 13 | 25d ago | |
| FormalMath-Lite | LongCat-Flash-Prover | Pass@899.8 | 13 | 25d ago | |
| CombiBench | LongCat-Flash-Prover | Pass@897 | 13 | 25d ago |