Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on OpenR1-Math-220k (unseen)

46Accuracy

LMNet

16.8824.443239.56May 19, 2025
Updated 20d ago

Evaluation Results

MethodLinks
2025.05
46
2025.05
34.7
2025.05
29
2025.05
29
2025.05
23.2
2025.05
18