Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HateMM: A Multi-Modal Dataset for Hate Video Classification

About

Hate speech has become one of the most significant issues in modern society, having implications in both the online and the offline world. Due to this, hate speech research has recently gained a lot of traction. However, most of the work has primarily focused on text media with relatively little work on images and even lesser on videos. Thus, early stage automated video moderation techniques are needed to handle the videos that are being uploaded to keep the platform safe and healthy. With a view to detect and remove hateful content from the video sharing platforms, our work focuses on hate video detection using multi-modalities. To this end, we curate ~43 hours of videos from BitChute and manually annotate them as hate or non-hate, along with the frame spans which could explain the labelling decision. To collect the relevant videos we harnessed search keywords from hate lexicons. We observe various cues in images and audio of hateful videos. Further, we build deep learning multi-modal models to classify the hate videos and observe that using all the modalities of the videos improves the overall hate speech detection performance (accuracy=0.798, macro F1-score=0.790) by ~5.7% compared to the best uni-modal model in terms of macro F1 score. In summary, our work takes the first step toward understanding and modeling hateful videos on video hosting platforms such as BitChute.

Mithun Das, Rohit Raj, Punyajoy Saha, Binny Mathew, Manish Gupta, Animesh Mukherjee• 2023

Related benchmarks

TaskDatasetResultRank
Video ClassificationHateMM English (test)
Accuracy79.8
15
Video ClassificationMultiHateClip Chinese (test)
Accuracy68.3
15
Frame-level hate localizationHateMM (test)
Accuracy74.81
13
Hate Video DetectionHMM to MHY (test)
Accuracy64.8
12
Hate Video DetectionHMM to MHB (test)
Accuracy58.3
12
Hate Video DetectionMHY to HMM (test)
Accuracy59.46
12
Hate Video DetectionMHY to MHB (test)
Accuracy56
12
Hate Video DetectionMHB to HMM (test)
Accuracy45.71
12
Hate Video DetectionMHB to MHY (test)
ACC53
12
Frame-level hate localizationMHC (test)
Accuracy0.6861
11
Showing 10 of 11 rows

Other info

Code

Follow for update