Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-document Visual Question Answering on MMLongBench Overall

90.77Average Score

GPT-5.5

25.031642.098359.16576.2317May 13, 2026
Updated 20d ago

Evaluation Results

MethodLinks
2026.05
90.77
2026.05
83.66
2026.05
80.13
2026.05
74.37
2026.05
69.41
2026.05
67.1
2026.05
60.83
2026.05
58.31
2026.05
57.7
2026.05
52.63
2026.05
50.59
2026.05
48.88
2026.05
47.76
2026.05
47.74
2026.05
47.47
2026.05
47.15
2026.05
38.03
2026.05
37.08
2026.05
34.24
2026.05
33.81
2026.05
30.23
2026.05
27.56