Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Daily-Omni

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio-visual understandingDaily-Omni
Accuracy82.8
58
Audio-Visual DialogueDaily-Omni
Score71.9
32
Video UnderstandingDaily-Omni
Daily Score57.7
20
Audiovisual Understanding & ReasoningDaily-Omni
Score77.9
15
Omnimodal common event understandingDaily-Omni
Accuracy81.4
13
QA performance by Gemini-2.5-Pro based on captionsDaily-Omni (test)
Daily-Omni QA Score61.2
13
Video Question AnsweringDaily-Omni
Score60.2
11
Audio-Visual Question AnsweringDaily-Omni 1 FPS
Metric 3070.9
8
Audio-Visual Question AnsweringDaily-Omni
Score73.6
8
Audio-Visual PerceptionDaily-Omni
Score60.65
8
Omni-modal collaborative reasoningDaily-Omni
Top-1 Accuracy71.09
6
Showing 11 of 11 rows