Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MMDU

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-turn visual dialogueMMDU 45K
Accuracy3.79
18
Multi-image Dialogue UnderstandingMMDU
Accuracy26.37
12
Multimodal dialogue understandingMMDU
GPT-4o Score0.703
10
Multi-turn Multi-image DialogMMDU
Accuracy66.3
4
Showing 4 of 4 rows