Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-document Visual Question Answering on MMLongBench 64K context

93.1MMLB-D

GPT-5.5

22.27640.66359.0577.437May 13, 2026
Updated 20d ago

Evaluation Results

MethodLinks
2026.05
93.192.5295.7793.8
2026.05
79.478.259383.55
2026.05
74.4379.329181.58
2026.05
65.468.138974.18
2026.05
61.5672.3787.173.68
2026.05
56.3362.428467.58
2026.05
51.9758.9677.562.81
2026.05
44.0660.587760.55
2026.05
40.6742.826449.16
2026.05
39.1748.375246.51
2026.05
3851.726952.91
2026.05
3662.698059.56
2026.05
3630.214537.07
2026.05
3649.327453.11
2026.05
35.3349.696750.67
2026.05
32.1749.577552.24
2026.05
31.5249.596348.03
2026.05
29.2647.943838.4
2026.05
29.0448.575343.54
2026.05
2829.864835.29
2026.05
27.7252.847050.19
2026.05
2533.335738.44