Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multimodal Misinformation Detection on MMFakeBench (val)
Loading...
81.65
F1 Score
MERIT
71.926
74.4505
76.975
79.4995
Oct 20, 2025
F1 Score
Accuracy
Precision
Sensitivity
Specificity
Updated 1mo ago
Evaluation Results
Method
Method
Links
F1 Score
Accuracy
Precision
Sensitivity
Specificity
MERIT
Model=GPT-4o-mini
2025.10
81.65
75.1
84.3
79.14
65.67
MMD-Agent
Model=GPT-4o-mini
2025.10
78.8
72.5
85.59
73
71.33
GPT-4V MMD-Agent
Model=GPT-4V
2025.10
74
76.8
73.4
75.5
-
GPT-4V Standard
Model=GPT-4V
2025.10
72.3
75.6
72.1
72.8
-
Feedback
Search any
task
Search any
task