Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mantis

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-image reasoningMantis (test)
Accuracy72.81
39
Adversarial AttackMantis Eval
Attack Success Rate84.57
37
Multi-image ReasoningMantis
Accuracy71
18
Visual Question AnsweringMantis Eval
ASR71.32
12
Multimodal ReasoningMantis-Eval
Accuracy59.23
11
Interleaved Image Multimodal UnderstandingMantis
Score64.2
7
Multi-image Visual Question AnsweringMantis
Accuracy76.5
4
Multi-image in the WildMantis
Accuracy77.6
4
Intent PredictionMANTIS
AP77.1
4
Multi-image Multi-modal UnderstandingMantis
Accuracy65.4
2
Showing 10 of 10 rows