Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Formal Theorem Proving on NuminaMath LEAN (unsolved)
Loading...
26
Accuracy
Ax-Prover
-1.04
5.98
13
20.02
Oct 14, 2025
Accuracy
Updated 9d ago
Evaluation Results
Method
Method
Links
Accuracy
Ax-Prover
2025.10
26
DS-Prover
2025.10
18
Sonnet
2025.10
1
Kimina
evaluation_protocol=pa...
2025.10
0
Feedback
Search any
task
Search any
task