Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Human Evaluation on MathQA

89.2Accuracy

Ours

57.89666.02374.1582.277Feb 18, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.02
89.20.60.460.6
2025.02
63.50.230.260.22
2025.02
59.10.170.280.18