Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:MMSD2.0: Towards a Reliable Multi-modal Sarcasm Detection System

Jul 14, 2023

Libo Qin, Shijue Huang, Qiguang Chen, Chenran Cai, Yudi Zhang, Bin Liang, Wanxiang Che, Ruifeng Xu

Figure 1 for MMSD2.0: Towards a Reliable Multi-modal Sarcasm Detection System

Figure 2 for MMSD2.0: Towards a Reliable Multi-modal Sarcasm Detection System

Figure 3 for MMSD2.0: Towards a Reliable Multi-modal Sarcasm Detection System

Figure 4 for MMSD2.0: Towards a Reliable Multi-modal Sarcasm Detection System

Share this with someone who'll enjoy it:

Abstract:Multi-modal sarcasm detection has attracted much recent attention. Nevertheless, the existing benchmark (MMSD) has some shortcomings that hinder the development of reliable multi-modal sarcasm detection system: (1) There are some spurious cues in MMSD, leading to the model bias learning; (2) The negative samples in MMSD are not always reasonable. To solve the aforementioned issues, we introduce MMSD2.0, a correction dataset that fixes the shortcomings of MMSD, by removing the spurious cues and re-annotating the unreasonable samples. Meanwhile, we present a novel framework called multi-view CLIP that is capable of leveraging multi-grained cues from multiple perspectives (i.e., text, image, and text-image interaction view) for multi-modal sarcasm detection. Extensive experiments show that MMSD2.0 is a valuable benchmark for building reliable multi-modal sarcasm detection systems and multi-view CLIP can significantly outperform the previous best baselines.

* Accepted by ACL2023 Findings

View paper on

Share this with someone who'll enjoy it:

Title:MMSD2.0: Towards a Reliable Multi-modal Sarcasm Detection System

Paper and Code