Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ProofNet

Benchmarks

Task NameDataset NameSOTA ResultTrend
AutoformalizationProofNet
Compilation Pass Rate@1094.1
28
Formal Theorem ProvingProofNet
Accuracy24.26
26
Step-level correctness assessmentProofNet (test)
PR-AUC32.9
22
Step-level reasoning verificationProofNet
PR-AUC68.2
19
Theorem ProvingProofNet (test)
Pass@3247.3
15
Auto-formalizationProofNet (test)
Pass@897.9
13
AutoformalizationProofNet (test)
πFV44.09
12
Formal Theorem ProvingProofNet (test)
Pass@144.62
12
Mathematical ReasoningProofNet
Accuracy97.2
11
Statement generationProofNet N = 186 (test)
CH@10098.4
11
Theorem ProvingProofNet (val)
Accuracy25.4
11
Lean theorem provingPROOFNET (186 problems)
Pass@824.73
9
Theorem ProvingProofNet (all)
Accuracy25.3
7
Mathematical ReasoningProofNet (test)
Accuracy95.6
6
Formal Theorem ProvingProofNet (val)
Pass Rate9.04
6
Formal ReasoningProofNet
ASR90.7
4
Autoformalization and ProvingProofNet N=186 (test)
Pass@640.7849
4
Mathematical ReasoningProofNet
PPL27.9
3
Theorem AutoformalizationProofNet
Objects3.67
1
Showing 19 of 19 rows