Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Theorem Proving on MiniF2F (val)

63.9Success Rate

InternLM2-StepProver

7.251221.958136.66551.3719May 22, 2022Nov 16, 2022May 14, 2023Nov 8, 2023May 5, 2024Oct 30, 2024Apr 27, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2024.08
63.9---
2024.08
61.9---
2024.08
61.9---
2024.08
58.6---
2024.08
55.3---
2024.08
55.3---
2024.08
55.3---
2024.08
55.3---
2024.08
48---
2024.08
48---
2024.08
42.6---
2024.08
42.6---
2022.05
37.3---
2024.08
37.3---
2024.08
37.3---
2022.05
36.1---
2025.04
34.0220.6412
2025.04
34.0220.6412.68
33.6---
2022.05
33.6---
2025.04
33.21.890.6513.89
2025.04
31.563.11--
2025.04
31.563.11--
2025.04
31.152.89--
2025.04
29.512.670.949.68
2022.05
28.3---
28.3---
2024.08
28.3---
2024.08
28.3---
27.1---
2025.04
27.052.83--
2025.04
26.642.381.1216.33
2025.04
25.411.92127.66
25---
2025.04
24.591.890.9743.18
2022.05
23.9---
2022.05
23.9---
2025.04
23.77---
2025.04
21.722.12--
2025.04
21.131.730.7437.5
2025.04
20.49---
2025.04
20.08---
2025.04
20.081.92--
2025.04
18.441.95--
2025.04
18.032.33--
2024.08
18---
2024.08
18---
2025.04
16.8---
2025.04
15.16---
2025.04
15.16---
2025.04
14.75---
2025.04
14.75---
2025.04
13.52---
2025.04
12.7---
2025.04
12.3---
9.9---
2024.08
9.9---
2024.08
9.9---
2025.04
9.43---