| MiniF2F (val) | InternLM2-StepProver | Success Rate63.9 | | 59 | 3mo ago |
| Putnam-Bench | Seed-Prover 1.5 | Pass@3287.9 | | 29 | 27d ago |
| miniF2F Lean (test) | DeepSeekMath-Base | Pass@6452 | | 24 | 3mo ago |
| PutnamBench Lean | | Solved Rate668 | | 23 | 2mo ago |
| Prover-Bench | LongCat-Flash-Prover | Pass@3270.8 | | 19 | 2mo ago |
| MathOlympiad-Bench | LongCat-Flash-Prover | Pass@3246.7 | | 16 | 2mo ago |
| LeanDojo (random) | LeanListener | Pass@153.21 | | 16 | 3mo ago |
| ProofNet (test) | LongCat-Flash-Prover | Pass@3247.3 | | 15 | 2mo ago |
| set.mm (test) | HOLOPHRASM + MetaGen-IL | Proofs Found (Test)600 | | 14 | 3mo ago |
| PutnamBench (test) | Hilbert | Accuracy72 | | 13 | 9d ago |
| CombiBench | | Proof Length8 | | 13 | 1mo ago |
| ProverBench | | Proof Length5.5 | | 13 | 1mo ago |
| MO-INT | | Proof Length17 | | 13 | 1mo ago |
| CombiBench Combinatorics | DreamProver | Solved Problems27 | | 13 | 1mo ago |
| ProverBench Number Theory | DreamProver | Solved Problems25 | | 13 | 1mo ago |
| PutnamBench Number Theory | DreamProver | Solved Problems19 | | 13 | 1mo ago |
| MO-INT | DreamProver | Solved Problems17 | | 13 | 1mo ago |
| ChenNEQ | DreamProver | Solved Problems36 | | 13 | 1mo ago |
| 567NEQ | DreamProver | Solved Problems57 | | 13 | 1mo ago |
| TheoremQA | InternLM2-20B | Accuracy13.5 | | 13 | 3mo ago |
| LCI (test) | WZ-LLM | Success Rate34 | | 12 | 27d ago |
| LeanDojo (novel premises) | LeanListener | Pass@141.11 | | 12 | 3mo ago |
| ProofNet (val) | DeepSeek-Prover-V1.5-RL + RMaxTS | Accuracy25.4 | | 11 | 3mo ago |
| miniF2F Lean (val) | DeepSeekMath-Base | Cumulative Pass Rate60.2 | | 10 | 3mo ago |
| PutnamBench | DreamProver | Average Proof Length62.5 | | 9 | 1mo ago |