Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Long-document Understanding on MMLongBench-Doc 1.0 (test)

46.7Reports Score

BayesRAG

29.33233.84138.3542.859Jan 12, 2026Jan 20, 2026Jan 29, 2026Feb 7, 2026Feb 15, 2026Feb 24, 2026Mar 5, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
46.742.641.853.632.734.841.646.43739.431.138.641.248.638.844.1
45.340.639.5463234.637.343.536.439.630.533.339.242.737.141.5
44.86--52.2-52.26-52.34-51-62.19-56.14-51.29
2026.03
43.15--48.55-43.21-52.25-46-53.08-50.42-47.21
2026.03
42.8--44.12-43.21-52.34-46-50.61-50.42-46.09
2026.03
41.43--49.27-43.43-52.25-41-54.32-48.72-45.55
41.09--41.3-44.22-45.16-40-54.32-52.13-44.36
2026.03
40.41--47.82-42.42-50.96-42-51.85-46.15-44.86
2026.01
38.936.831.535.227.829.930.635.233.436.628.730.831.234.13234.9
36.729.82938.216.8242432.429.534.641.350.638.34128.235.2
2026.01
36.129.833.339.516.52127.63219.624.721.825.935.336.726.531.4
2026.01
3027.919.420.821.52419.823.719.823.719.724.611.111.121.323.8