Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FINER-DOCCI

Benchmarks

Task NameDataset NameSOTA ResultTrend
Hallucination EvaluationFINER-DOCCI 3K MCQs per setting 1.0
Multi-Object Paired Accuracy65.9
16
Showing 1 of 1 rows