Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Math on Math OOD

92.04Accuracy

GPT-5.2-chat-latest

78.686482.153285.6289.0868May 20, 2026
Updated 13d ago

Evaluation Results

MethodLinks
92.0488.5
90.2771.68
2026.05
88.562.83
2026.05
88.4961.06
2026.05
87.6154.87
2026.05
86.2851.33
84.5146.02
2026.05
84.0745.13
83.6343.36
2026.05
83.1942.48
2026.05
82.340.71
81.8625.66
79.220.35