Share your thoughts, 1 month free Claude Pro on usSee more

Informal-to-formal proving on miniF2F (val)

25.8Proven Theorems Rate

DeepSeekMath-Base

Updated 4mo ago

Evaluation Results

Method	Links
DeepSeekMath-Base 2024.02		25.8
LLEMMA-34b 2023.10		21.03
Llemma 2024.02		21
LLEMMA-7b 2023.10		20.6
Llemma 2024.02		20.6
Mistral 2024.02		18.9
CodeLlama 2024.02		18.5
Code Llama 34b 2023.10		18.45
Code Llama 7b 2023.10		16.31
CodeLlama 2024.02		16.3
Sledgehammer 2023.10		14.72