Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Datasets
MMHal
Loading...
Benchmarks
Task Name
Dataset Name
Task Name
Dataset Name
SOTA Result
Trend
Results
Hallucination Evaluation
MMHal
Score
4.2
37
VQA Hallucination
MMHal
Score
3.87
21
Pointwise Scoring
MMHal pointwise
Kendall's Tau
0.949
9
Hallucination Evaluation
MMHal v1.0 (test)
Score
2.23
6
Showing 4 of 4 rows
25 / page
50 / page
100 / page
1
Feedback
Search any
task
Search any
task