Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VidHalluc

Benchmarks

Task NameDataset NameSOTA ResultTrend
Video UnderstandingVidHalluc
Accuracy83.84
18
Hallucination examinationVidHalluc
BQA78.08
15
Video ReasoningVidHalluc (test)
Binary QA Accuracy (ACH)81.15
13
Hallucination DetectionVidHalluc 19
MC Accuracy91
4
Showing 4 of 4 rows