Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

All Changes May Have Invariant Principles: Improving Ever-Shifting Harmful Meme Detection via Design Concept Reproduction

About

Harmful memes are ever-shifting in the Internet communities, which are difficult to analyze due to their type-shifting and temporal-evolving nature. Although these memes are shifting, we find that different memes may share invariant principles, i.e., the underlying design concept of malicious users, which can help us analyze why these memes are harmful. In this paper, we propose RepMD, an ever-shifting harmful meme detection method based on the design concept reproduction. We first refer to the attack tree to define the Design Concept Graph (DCG), which describes steps that people may take to design a harmful meme. Then, we derive the DCG from historical memes with design step reproduction and graph pruning. Finally, we use DCG to guide the Multimodal Large Language Model (MLLM) to detect harmful memes. The evaluation results show that RepMD achieves the highest accuracy with 81.1% and has slight accuracy decreases when generalized to type-shifting and temporal-evolving memes. Human evaluation shows that RepMD can improve the efficiency of human discovery on harmful memes, with 15$\sim$30 seconds per meme.

Ziyou Jiang, Mingyang Li, Junjie Wang, Yuekai Huang, Jie Huang, Zhiyuan Chang, Zhaoyang Li, Qing Wang• 2026

Related benchmarks

TaskDatasetResultRank
Harmful Meme DetectionGOAT-Bench In-Domain
Racism F188.5
11
Harmful Meme DetectionTwitter Temporal-Evolving Memes 2025 (Apr~Jun)
F1 Score82.4
8
Harmful Meme DetectionTwitter Temporal-Evolving Memes 2025 (Jul~Sep)
F1 Score82.4
8
Harmful Meme DetectionTwitter Temporal-Evolving Memes 2025 (Oct~Dec)
F1 Score86.7
8
Harmful Meme DetectionGOAT-Bench (Out-Of-Domain)
Racism F187.1
7
Safeguarding harmful image generationHarmful Meme Racism
Average SSIM0.12
2
Safeguarding harmful image generationHarmful Meme Misogyny
Average SSIM0.33
2
Safeguarding harmful image generationHarmful Meme Offensiveness
Average SSIM0.19
2
Safeguarding harmful image generationHarmful Meme Sarcasm--
1
Safeguarding harmful image generationHarmful Meme Toxicity--
1
Showing 10 of 10 rows

Other info

Follow for update