Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Document Visual Question Answering on MMLongbench doc

45.6Accuracy

GPT-4.1

12.11220.80629.538.194Dec 14, 2025Dec 21, 2025Dec 29, 2025Jan 6, 2026Jan 13, 2026Jan 21, 2026Jan 29, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
45.649.7-----
2025.12
41.642.3-----
2026.01
36.55-38.5931.831.4632.2333.05
2025.12
33.938.4-----
2026.01
33.22-37.3727.1924.7232.8927.97
2025.12
3331.5-----
2026.01
32.71-35.2328.5726.9730.2327.97
2026.01
32.25-32.5527.6526.429.930.51
2026.01
31.66-33.2224.8826.9729.927.12
2026.01
31.66-35.9123.9629.2127.9128.81
2026.01
31.55-31.5424.8830.3430.2320.34
2025.12
31.330.7-----
2026.01
30.97-32.5527.1925.2827.5727.12
2026.01
30.5-30.5426.7327.5327.5725.42
2026.01
29.92-31.5425.3527.5326.9122.88
2025.12
29.628.8-----
2025.12
28.823-----
2025.12
28.629.4-----
2026.01
27.82-27.1822.1221.9127.5725.42
2026.01
27.59-31.2123.527.5323.5921.19
2025.12
27.527.2-----
2026.01
27.12-29.1921.6622.4724.2522.03
2026.01
25.96-33.8921.221.3518.9421.19
2025.12
24.924.6-----
2025.12
2324.2-----
2025.12
21.220.7-----
2025.12
2122.6-----
2025.12
18.818.3-----
2025.12
13.48.9-----