Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Holistic Evaluation on Combined Suite General Reasoning Perception Text

76.3Text Average

Qwen3-32B

-0.1419.70539.5559.395May 14, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2026.05
76.3-----
2026.05
67.9-----
2026.05
58.160.950.177.961.160.3
2026.05
53.157.648.373.458.156.7
2026.05
52.652.54172.453.153
2026.05
45.151.140.168.351.249.6
2026.05
26.753.332.277.851.544.7
2026.05
26.345.936.367.847.841.9
2026.05
25.144.829.864.344.138.9
2026.05
2140.827.460.540.735.3
2026.05
19.334.923.736.531.127.9
2026.05
19.142.427.458.440.834.9
2026.05
2.851.836.767.45037.1