Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Harmful Meme Detection on Twitter Temporal-Evolving Memes 2025 (Oct~Dec)
Loading...
86.7
F1 Score
REPMD
43.644
54.822
66
77.178
Jan 8, 2026
F1 Score
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
Accuracy
REPMD
Base MLLM=Doubao-1.5-V...
2026.01
86.7
85
REPMD
Base MLLM=Doubao-1.5-V...
2026.01
86.7
85
REPMD
Base MLLM=GPT-4o, Temp...
2026.01
83.5
80
REPMD
Base MLLM=GPT-4o, Temp...
2026.01
82.4
78
GPT-4o
Mode=Vanilla
2026.01
67.4
65
Doubao-1.5-V-Pro
Mode=Vanilla
2026.01
67.4
65
RA-HMD
Temporal Evaluation Pr...
2026.01
56.4
55.2
RA-HMD
Temporal Evaluation Pr...
2026.01
45.3
44.1
Feedback
Search any
task
Search any
task