Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MIRAGE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Coarse-level Multimodal Misinformation DetectionMiRAGe News
Accuracy80.2
14
Medical Question AnsweringMIRAGE (test)
MMLU-Med89.44
12
Biomedical Retrieval-Augmented GenerationMirage
MMLU-med Accuracy87.24
10
Flicker-banding and Moire RemovalMIRAGE cropped (test)
SSIM0.7354
9
Multi-modal Forgery DetectionMiRAGe
Accuracy53.92
5
Binary forgery detectionMiRAGe
Accuracy56.99
5
Data source relevance classificationMIRAGE (test)
Accuracy86.63
1
Showing 7 of 7 rows