Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Formal Theorem Proving on miniF2F rw (test)

75Pass@8

Goedel-Prover-V2-8B

46.29653.74861.268.652May 21, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.05
7578.3--77
2026.05
74.677.5-58.976.4
2026.05
74.478.6--76.7
2026.05
72.976.7-5675.3
2026.05
7175-5873.2
2026.05
70.974.2--72.9
2026.05
70.374.2--72.3
2026.05
70.174.4--72.6
2026.05
68.872.5--71
2026.05
68.573-53.470.8
2026.05
68.271.3-5570
2026.05
66.570.7-52.268.9
2026.05
6467.368.8--
2026.05
62.66667.6--
2026.05
61.764.666.8--
2026.05
59.763.264.8--
2026.05
52.957.759--
2026.05
52.657.959.9--
2026.05
51.756.557.8--
2026.05
50.55657.6--
2026.05
5055.757.4--
2026.05
49.554.756.4--
2026.05
49.155.557.1--
2026.05
47.453.355.2--