Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Harmful Meme Detection on Twitter Temporal-Evolving Memes 2025 (Apr~Jun)
Loading...
82.4
F1 Score
REPMD
29.672
43.361
57.05
70.739
Jan 8, 2026
F1 Score
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
Accuracy
REPMD
Base MLLM=Doubao-1.5-V...
2026.01
82.4
80
REPMD
Base MLLM=Doubao-1.5-V...
2026.01
82.4
80
REPMD
Base MLLM=GPT-4o, Temp...
2026.01
67.4
65
Doubao-1.5-V-Pro
Mode=Vanilla
2026.01
67.4
65
REPMD
Base MLLM=GPT-4o, Temp...
2026.01
60.2
60
GPT-4o
Mode=Vanilla
2026.01
51.3
50
RA-HMD
Temporal Evaluation Pr...
2026.01
45.9
44.9
RA-HMD
Temporal Evaluation Pr...
2026.01
31.7
31.2
Feedback
Search any
task
Search any
task