Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MuirBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-image ReasoningMuirBench
Accuracy77.2
48
Multi-image UnderstandingMuirBench
Score68
26
Multi-image reasoningMuirbench (test)
Accuracy68
24
Multi-Image UnderstandingMuirBench (test)
Accuracy68
21
Multi-Image UnderstandingMuirBench 142 (test)
Score86.1
19
Multi-image UnderstandingMuirBench Multi-image Understanding
Accuracy62.3
17
Multimodal ReasoningMuirBench
Accuracy57.14
11
Procedural Temporal UnderstandingMuirBench (test)
Overall Score65.04
7
General Visual Question AnsweringMuirBench
Score70.7
5
Comprehensive Multi-imageMuirBench
Accuracy62.3
4
Multi-image Multi-modal UnderstandingMuirBench
Accuracy41.8
2
Showing 11 of 11 rows