Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:WWW: Where, Which and Whatever Enhancing Interpretability in Multimodal Deepfake Detection

Aug 06, 2024

Juho Jung, Sangyoun Lee, Jooeon Kang, Yunjin Na

Figure 1 for WWW: Where, Which and Whatever Enhancing Interpretability in Multimodal Deepfake Detection

Figure 2 for WWW: Where, Which and Whatever Enhancing Interpretability in Multimodal Deepfake Detection

Figure 3 for WWW: Where, Which and Whatever Enhancing Interpretability in Multimodal Deepfake Detection

Figure 4 for WWW: Where, Which and Whatever Enhancing Interpretability in Multimodal Deepfake Detection

Share this with someone who'll enjoy it:

Abstract:All current benchmarks for multimodal deepfake detection manipulate entire frames using various generation techniques, resulting in oversaturated detection accuracies exceeding 94% at the video-level classification. However, these benchmarks struggle to detect dynamic deepfake attacks with challenging frame-by-frame alterations presented in real-world scenarios. To address this limitation, we introduce FakeMix, a novel clip-level evaluation benchmark aimed at identifying manipulated segments within both video and audio, providing insight into the origins of deepfakes. Furthermore, we propose novel evaluation metrics, Temporal Accuracy (TA) and Frame-wise Discrimination Metric (FDM), to assess the robustness of deepfake detection models. Evaluating state-of-the-art models against diverse deepfake benchmarks, particularly FakeMix, demonstrates the effectiveness of our approach comprehensively. Specifically, while achieving an Average Precision (AP) of 94.2% at the video-level, the evaluation of the existing models at the clip-level using the proposed metrics, TA and FDM, yielded sharp declines in accuracy to 53.1%, and 52.1%, respectively.

* 4 pages, 2 figures, 2 tables, Accepted as Oral Presentation at The Trustworthy AI Workshop @ IJCAI 2024

View paper on

Share this with someone who'll enjoy it:

Title:WWW: Where, Which and Whatever Enhancing Interpretability in Multimodal Deepfake Detection

Paper and Code