Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

High-Resolution Multimodal Reasoning on HR-Bench 4K

88.3Overall Score

TTSP

62.92469.51276.182.688Mar 29, 2026Mar 31, 2026Apr 3, 2026Apr 5, 2026Apr 8, 2026Apr 10, 2026Apr 13, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.04
88.396.879.8
2026.04
86.49676.8
2026.04
86.495.877
2026.04
86.395.876.8
2026.04
86.295.377
2026.04
85.897.374.3
2026.04
85.795.575.8
2026.04
85.695.375.8
2026.04
85.493.877
2026.04
84.395.573
2026.04
83.995.372.5
2026.04
83.995.372.5
2026.04
83.79572.3
2026.04
83.79572.3
2026.04
83.694.872.3
2026.04
82.79372.3
2026.04
819369
2026.04
809070
2026.03
77.5--
2026.04
779163
2026.03
76.492.760
2026.03
75.386.863.7
2026.03
75.191.359
2026.04
75.191.359
2026.03
73.685.861.5
2026.03
7392.253.7
2026.03
72.990.555.2
72.98660.3
2026.03
72.683.561.8
2026.03
71.383.858.8
2026.03
70.78259.3
2026.03
7078.861.3
2026.03
69.278.859.8
2026.03
698454
2026.03
6880.355.8
2026.03
6675.556.5
2026.03
65.869.861.8
2026.03
6566.863.3
2026.04
6566.863.3
2026.03
63.97354.7