Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nitya Mani

An Interpretable Approach to Hateful Meme Detection

Aug 09, 2021

Tanvi Deshpande, Nitya Mani

Figure 1 for An Interpretable Approach to Hateful Meme Detection

Figure 2 for An Interpretable Approach to Hateful Meme Detection

Figure 3 for An Interpretable Approach to Hateful Meme Detection

Figure 4 for An Interpretable Approach to Hateful Meme Detection

Abstract:Hateful memes are an emerging method of spreading hate on the internet, relying on both images and text to convey a hateful message. We take an interpretable approach to hateful meme detection, using machine learning and simple heuristics to identify the features most important to classifying a meme as hateful. In the process, we build a gradient-boosted decision tree and an LSTM-based model that achieve comparable performance (73.8 validation and 72.7 test auROC) to the gold standard of humans and state-of-the-art transformer models on this challenging task.

* 5 pages. 2021 ACM International Conference on Multimodal Interaction (ICMI)

Via

Access Paper or Ask Questions