Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-document Visual Question Answering on MMLongBench 128K context

83.33MMLB-D

GPT-5.5

14.253232.186650.1268.0534May 13, 2026
Updated 20d ago

Evaluation Results

MethodLinks
2026.05
83.3389.12-86.22
2026.05
77.7580.579383.77
2026.05
69.6376.389078.67
2026.05
65.0467.69174.55
2026.05
52.9673.06-63.01
2026.05
51.87628666.62
2026.05
40.1962.4573.958.85
2026.05
35.6843.256046.31
2026.05
34.1956.337755.84
2026.05
32.0860.157656.08
2026.05
30.5651.916047.49
2026.05
29.6550.335444.66
2026.05
29.6150.215344.27
2026.05
28.537.784537.09
2026.05
27.9461.136852.35
2026.05
26.9651.856848.94
2026.05
26.4433.385337.61
2026.05
25.8948.445844.11
2026.05
22.2440.091.0121.11
2026.05
20.8327.735133.19
2026.05
18.2831.89016.72
2026.05
16.9133.86016.92