Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Informal-to-Formal Proving on miniF2F (test)

24.6Accuracy

DeepSeekMath-Base

17.3219.2121.122.99Feb 5, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.02
24.6
2024.02
22.1
2024.02
21.3
2024.02
18
2024.02
18
2024.02
17.6