Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Video Question Answering on Video-MME v1.0 (test)

65.3Accuracy (Long)

GPT-4o

32.12440.73749.3557.963May 27, 2025Jul 25, 2025Sep 23, 2025Nov 22, 2025Jan 21, 2026Mar 22, 2026May 21, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.05
65.3-70.371.9
2026.05
60.6-59.5-
2025.05
53.671.963.763.1
2025.05
52.671.862.762.4
2026.05
52.1-57.961.2
2026.05
51-61.863.6
2026.05
50.1-56.659.4
2026.05
50-6160.8
2026.05
49.8-60.862.6
2026.05
49.7-53.757.2
2026.05
49.5-60.262.5
2026.05
49.1-60.159.5
2026.05
48.8-56.457.3
2026.05
48.5-41.449.1
2026.05
48.5-5961.5
2026.05
47.9-40.148.2
2026.05
47.2-5860.3
2026.05
46.2-50.452.6
2026.05
46.2-54.756.9
2026.05
46.2-39.745.4
2026.05
33.4--38.2
2026.05
---59.3