Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Visual Grounded Reasoning on HR-Bench-8K

76.3Overall Score

Qwen2.5-VL-72B

59.1463.59568.0572.505Nov 27, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.11
76.384.368.3
2025.11
7589.560.4
2025.11
73.186.559.8
2025.11
72.686.858.5
2025.11
71.686.556.8
2025.11
68.883.554
2025.11
67.371.862.8
2025.11
6771.362.8
2025.11
66.98054.3
2025.11
6264.359.8
2025.11
60.968.853
2025.11
59.865.354.3