Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Modularized Networks for Few-shot Hateful Meme Detection

About

In this paper, we address the challenge of detecting hateful memes in the low-resource setting where only a few labeled examples are available. Our approach leverages the compositionality of Low-rank adaptation (LoRA), a widely used parameter-efficient tuning technique. We commence by fine-tuning large language models (LLMs) with LoRA on selected tasks pertinent to hateful meme detection, thereby generating a suite of LoRA modules. These modules are capable of essential reasoning skills for hateful meme detection. We then use the few available annotated samples to train a module composer, which assigns weights to the LoRA modules based on their relevance. The model's learnable parameters are directly proportional to the number of LoRA modules. This modularized network, underpinned by LLMs and augmented with LoRA modules, exhibits enhanced generalization in the context of hateful meme detection. Our evaluation spans three datasets designed for hateful meme detection in a few-shot learning context. The proposed method demonstrates superior performance to traditional in-context learning, which is also more computationally intensive during inference.We then use the few available annotated samples to train a module composer, which assigns weights to the LoRA modules based on their relevance. The model's learnable parameters are directly proportional to the number of LoRA modules. This modularized network, underpinned by LLMs and augmented with LoRA modules, exhibits enhanced generalization in the context of hateful meme detection. Our evaluation spans three datasets designed for hateful meme detection in a few-shot learning context. The proposed method demonstrates superior performance to traditional in-context learning, which is also more computationally intensive during inference.

Rui Cao, Roy Ka-Wei Lee, Jing Jiang• 2024

Related benchmarks

TaskDatasetResultRank
Meme ClassificationHatefulMemes
AUC64.5
43
Meme ClassificationHarMeme
Accuracy69.5
30
Meme ClassificationMAMI
Accuracy0.61
30
Harmful Meme DetectionFHM
Accuracy57.6
29
Harmful Meme DetectionMAMI
Accuracy69.05
19
Harmful Meme DetectionToxiCN
Accuracy61.24
16
ClassificationMAMI
Accuracy69.05
16
ClassificationFHM
Accuracy57.6
16
Harmful Meme DetectionGOAT-Bench In-Domain
Racism F158.3
11
Harmful Meme DetectionGOAT-Bench (Out-Of-Domain)
Racism F136.6
7
Showing 10 of 10 rows

Other info

Follow for update