Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Math Evaluation on Math ID

97.26Accuracy

GPT-5.2-chat-latest

88.659290.892193.12595.3579May 20, 2026
Updated 13d ago

Evaluation Results

MethodLinks
97.2680.53
96.9475.22
2026.05
94.9368.14
2026.05
93.8164.6
2026.05
93.6463.72
2026.05
93.5665.49
2026.05
93.0260.71
92.8461.06
2026.05
92.7758.94
2026.05
92.2858.41
90.0233.63
89.6237.17
88.9948.67