Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PutnamBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Formal Theorem ProvingPutnamBench
Solved Count668
42
Theorem ProvingPutnamBench Lean
Solved Rate668
23
Theorem ProvingPutnamBench (test)
Accuracy72
13
Theorem ProvingPutnamBench Number Theory
Solved Problems19
13
Formal Theorem ProvingPutnamBench September 2025
Solved Problems Count462
11
Theorem ProvingPutnamBench
Average Proof Length62.5
9
Mathematical formalizationPutnamBench 672 problems
C@163
8
Formal Mathematical Answer-ConstructionPutnamBench
Solved Instances17
7
AutoformalizationPutnamBench (PB)
Mean Cycle Consistency0.561
6
Automated Theorem ProvingPutnamBench Easy Mode
Solved Problems (Pass@32)43
3
Automated Theorem ProvingPutnamBench Hard Mode
Total Solved (Pass@32)36
2
Showing 11 of 11 rows