Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-video Question Answering on Long VideoBench (val)

72.6Accuracy

GPT-5

37.7646.80555.8564.895Jan 14, 2025Mar 9, 2025May 3, 2025Jun 27, 2025Aug 21, 2025Oct 15, 2025Dec 9, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2025.12
72.6
2025.01
66.7
2025.02
66.7
2025.12
66.7
64
2025.02
64
2025.12
64
63.6
2025.12
62.9
2025.02
62.7
2025.12
62.7
2025.12
62.1
61.9
2025.02
61.6
2025.02
61.3
2025.02
60.5
60
2025.12
60
2025.02
59.3
2025.12
59.3
2025.02
58.9
2025.01
58.6
58.2
2025.12
57.7
2025.12
56
55.6
2025.02
55.5
2025.02
54.9
2025.02
54.8
54.2
2025.02
53.2
2025.12
52.1
2025.12
50.7
2025.12
47.8
2025.01
39.8
2025.02
39.1