| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Formal Theorem Proving | mathlib (val) | Pass@162.6 | 9 | |
| Proof Optimization (Length) | Mathlib | Improvement6.19 | 4 | |
| Formal Theorem Proving | mathlib (test) | Pass@163 | 3 | |
| Proof Optimization (Declarativity) | Mathlib | Improvement4.63 | 2 |