Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MMHal-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Hallucination EvaluationMMHal-Bench
MMHal Score4.32
174
Multi-modal Hallucination EvaluationMMHal-Bench v1.0 (test)
Overall Score2.14
12
Showing 2 of 2 rows