Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Daily-Omni

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio-visual understandingDaily-Omni
Accuracy82.8
27
Audiovisual Understanding & ReasoningDaily-Omni
Score77.9
15
QA performance by Gemini-2.5-Pro based on captionsDaily-Omni (test)
Daily-Omni QA Score61.2
13
Video Question AnsweringDaily-Omni
Score60.2
11
Audio-Visual Question AnsweringDaily-Omni
Score73.6
8
Audio-Visual PerceptionDaily-Omni
Score60.65
8
Omni-modal collaborative reasoningDaily-Omni
Top-1 Accuracy71.09
6
Showing 7 of 7 rows