| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Theorem Proving | PutnamBench Lean | Solved Rate668 | 23 | |
| Formal Theorem Proving | PutnamBench | Solve Rate87.9 | 14 | |
| Formal Theorem Proving | PutnamBench September 2025 | Solved Problems Count462 | 11 | |
| Mathematical formalization | PutnamBench 672 problems | C@163 | 8 | |
| Autoformalization | PutnamBench (PB) | Mean Cycle Consistency0.561 | 6 |