Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ProofWriter

Benchmarks

Task NameDataset NameSOTA ResultTrend
Logical ReasoningProofWriter (test)
Accuracy92.32
36
Logical ReasoningProofWriter
Accuracy98.4
32
Logical ReasoningProofWriter
Accuracy99.7
24
Deductive ReasoningProofWriter
Pass@197.4
18
Explanation RefinementProofWriter
Initial Score92
15
ReasoningProofWriter
Accuracy65
14
Logical ReasoningProofWriter (held-out)
Performance0.5483
14
Deductive logical reasoningProofWriter (test)
ExcRate100
12
Deductive ReasoningProofWriter
Calibrated Accuracy92.1
8
Deductive logical reasoningProofWriter 600 records (test)
Exc. Rate-
0
Showing 10 of 10 rows