Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Hallusion

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal ReasoningHallusion
Score63.51
13
Hallucination MitigationHallusion
Accuracy64.56
10
Multimodal ComprehensionHallusion
Score50.9
8
Showing 3 of 3 rows