Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Real-world perception-centric reasoning on V* (test)

84.25Accuracy

GLM-9B-DeltaThinker

57.074864.129971.18578.2401May 15, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2026.05
84.25
2026.05
82.73
81.15
79.58
76.96
2026.05
71.2
69.11
2026.05
58.12