Share your thoughts, 1 month free Claude Pro on usSee more

Informal-to-Formal Proving on miniF2F (test)

24.6Accuracy

DeepSeekMath-Base

Updated 5mo ago

Evaluation Results

Method	Links
DeepSeekMath-Base 2024.02		24.6
Llemma 2024.02		22.1
Llemma 2024.02		21.3
Mistral 2024.02		18
CodeLlama 2024.02		18
CodeLlama 2024.02		17.6