Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Harmful Meme Detection on Twitter Temporal-Evolving Memes 2025 (Jul~Sep)
Loading...
82.4
F1 Score
REPMD
38.512
49.906
61.3
72.694
Jan 8, 2026
F1 Score
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
Accuracy
REPMD
Base MLLM=Doubao-1.5-V...
2026.01
82.4
80
REPMD
Base MLLM=Doubao-1.5-V...
2026.01
82.4
80
Doubao-1.5-V-Pro
Mode=Vanilla
2026.01
66.2
65
REPMD
Base MLLM=GPT-4o, Temp...
2026.01
63
66
REPMD
Base MLLM=GPT-4o, Temp...
2026.01
61.6
65
GPT-4o
Mode=Vanilla
2026.01
56.3
55
RA-HMD
Temporal Evaluation Pr...
2026.01
52.7
51.5
RA-HMD
Temporal Evaluation Pr...
2026.01
40.2
40
Feedback
Search any
task
Search any
task