Share your thoughts, 1 month free Claude Pro on usSee more

Theorem Proving on miniF2F Lean (val)

60.2Cumulative Pass Rate

DeepSeekMath-Base

Updated 4mo ago

Evaluation Results

Method	Links
DeepSeekMath-Base 2024.05		60.2	-
Evariste 2022.05		58.6	-
Curriculum Learning 2024.05		58.6	-
Curriculum Learning 2024.05		47.3	-
Curriculum Learning 2024.05		41.2	-
Curriculum Learning 2024.05		33.6	-
Proof Artifact Co-Training 2024.05		29.3	-
GPT-4-turbo 0409 2024.05		25.4	-
DeepSeekMath-Base 2024.05		25.4	-
Proof Artifact Co-Training 2024.05		23.9	-
Supervised 2022.05		-	38.5
GPT-f 2022.05		-	47.3
Evariste-1d 2022.05		-	46.7
Evariste-7d 2022.05		-	47.5