Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Autoformalization and Proving on ProofNet N=186 (test)
Loading...
0.7849
Pass@64
Ground truth statement (oracle)
0.56129
0.619355
0.677419
0.735484
Mar 20, 2026
Pass@64
Complete@64
Theorem Complete@64
Updated 27d ago
Evaluation Results
Method
Method
Links
Pass@64
Complete@64
Theorem Complete@64
Ground truth statement (oracle)
2026.03
0.7849
0.2473
0.1828
FormalEvolve
K=2
2026.03
0.6828
0.2796
0.2419
Compile+Semantic Repair
Source=Kimina
2026.03
0.6398
0.2688
0.2473
Sample
2026.03
0.5699
0.2473
0.2204
Feedback
Search any
task
Search any
task