| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Theorem Proving | LeanDojo (random) | Pass@153.21 | 16 | |
| Theorem Proving | LeanDojo (novel premises) | Pass@141.11 | 12 | |
| Premise selection | LeanDojo Benchmark 4 Lean 3 (novel_premises) | R@19.8 | 6 | |
| Premise Selection | LeanDojo Benchmark (random) | R@113.5 | 5 | |
| Theorem Proving | LeanDojo Benchmark 4 Lean 3 (random) | Pass@148.6 | 2 |