Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MMIU

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-image UnderstandingMMIU
Accuracy55.8
65
Visual Question AnsweringMMIU
Accuracy71
19
Multi-Image UnderstandingMMIU 106 (test)
Score72.1
19
Narrative ReasoningMMIU (test)
BLEURT Score0.306
14
Multi-image UnderstandingMMIU (test)
Accuracy52.6
11
Image UnderstandingMMIU
MMIU Score40.2
7
Visual Quality AssessmentMMIU visual quality
Accuracy53
3
Text-to-Image RetrievalMMIU text2image_retrieval
Accuracy25.2
3
Emotion RecognitionMMIU emotion_findingemo
Accuracy26.9
3
Emotion RecognitionMMIU emotion_expw
Accuracy31.8
3
Forensic DetectionMMIU forensic_blink
Accuracy30.9
3
Forensic DetectionMMIU forensic_forgerynet
Accuracy87.4
3
Showing 12 of 12 rows