Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Document Visual Question Answering on MMLongbench doc

45.6Accuracy

GPT-4.1

12.11220.80629.538.194Sep 2, 2025Oct 11, 2025Nov 19, 2025Dec 28, 2025Feb 5, 2026Mar 16, 2026Apr 24, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2025.12
45.649.7---------
2025.09
43.25----------
2025.12
41.642.3---------
2025.09
41.13----------
2026.01
36.55-38.5931.831.4632.2333.05----
2025.12
33.938.4---------
2026.01
33.22-37.3727.1924.7232.8927.97----
2025.12
3331.5---------
2026.01
32.71-35.2328.5726.9730.2327.97----
2026.01
32.25-32.5527.6526.429.930.51----
2026.04
32.233.7-----2232.521.6
2026.01
31.66-33.2224.8826.9729.927.12----
2026.01
31.66-35.9123.9629.2127.9128.81----
2026.01
31.55-31.5424.8830.3430.2320.34----
2026.04
31.533.4-----5580.510.7
2025.09
31.33----------
2025.12
31.330.7---------
2025.09
31.05----------
2026.01
30.97-32.5527.1925.2827.5727.12----
2025.09
30.68----------
2026.01
30.5-30.5426.7327.5327.5725.42----
2026.01
29.92-31.5425.3527.5326.9122.88----
2025.12
29.628.8---------
2025.12
28.823---------
2025.12
28.629.4---------
2026.04
28.530.5-----2228.61.21.8
2026.04
28.228.3-----5.27.28.26.1
2026.01
27.82-27.1822.1221.9127.5725.42----
2026.01
27.59-31.2123.527.5323.5921.19----
2025.12
27.527.2---------
2026.04
27.429.3-----5580.50.60.7
2026.04
27.327.9-----13.716.93.92.9
2026.01
27.12-29.1921.6622.4724.2522.03----
2026.01
25.96-33.8921.221.3518.9421.19----
2026.04
25.326.8-----5535.80.61.4
2026.04
2526.8-----5.27.15.76.1
2025.12
24.924.6---------
2026.04
24.525.8-----13.716.92.22.9
2026.04
24.324.3-----13.715.12.53.2
2026.04
23.224-----13.78.32.55.4
2025.12
2324.2---------
2026.04
2323.2-----5566.20.60.8
2026.04
22.824.4-----5540.60.61.3
2025.12
21.220.7---------
2025.12
2122.6---------
2026.04
20.421.1-----13.710.12.54.5
2025.12
18.818.3---------
2025.12
13.48.9---------