Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Real-world perception-centric reasoning on Real-world perception-centric reasoning suite (test)

55.53Average Score

GLM-9B-DeltaThinker

39.82643.90347.9852.057May 15, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2026.05
55.53
2026.05
54.12
49.75
48.67
48.06
46.92
2026.05
46.45
2026.05
40.43