Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-context document understanding on MMLongBench-Doc
Loading...
42.9
Accuracy
GPT-4o
4.94
14.795
24.65
34.505
Oct 8, 2024
Accuracy
F1 Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
F1 Score
GPT-4o
category=Proprietary
2024.10
42.9
44.9
Qwen2-VL-72B
activated_params=72B,...
2024.10
33.3
35.7
GPT-4o mini
category=Proprietary
2024.10
29
28.6
ARIA
activated_params=3.9B...
2024.10
28.3
24.6
Gemini-1.5-Pro
category=Proprietary
2024.10
28.2
20.6
Gemini-1.5-Flash
category=Proprietary
2024.10
27
21.3
Qwen2-VL-7B
activated_params=7B, c...
2024.10
21.3
22.7
InternVL2-40B
activated_params=40B,...
2024.10
18.2
17.9
InternVL-Chat-V1.5
activated_params=26B,...
2024.10
14.6
13
Llama3.2-11B
activated_params=11B,...
2024.10
13.8
11.3
MiniCPM-V-2.6
activated_params=8B, c...
2024.10
11.5
11.6
Idefics2
activated_params=8B, c...
2024.10
7
6.8
Pixtral-12B
activated_params=12B,...
2024.10
6.4
6
Feedback
Search any
task
Search any
task