Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Theorem Proving on PutnamBench (test)

72Accuracy

Hilbert

-2.8816.563655.44Oct 14, 2025
Updated 9d ago

Evaluation Results

MethodLinks
2025.10
72462
2025.10
51329
2025.10
1492
2025.10
1386
2025.10
747
2025.10
423
2025.10
214
2025.10
210
2025.10
18
2025.10
17
2025.10
0.53
2025.10
0.21
2025.10
00