Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Theorem Proving on miniF2F Lean (val)

60.2Cumulative Pass Rate

DeepSeekMath-Base

22.44832.24942.0551.851May 23, 2022Sep 21, 2022Jan 21, 2023May 23, 2023Sep 22, 2023Jan 22, 2024May 23, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.05
60.2-
2022.05
58.6-
2024.05
58.6-
2024.05
47.3-
2024.05
41.2-
2024.05
33.6-
2024.05
29.3-
2024.05
25.4-
2024.05
25.4-
2024.05
23.9-
2022.05
-38.5
2022.05
-47.3
2022.05
-46.7
2022.05
-47.5