Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection

Jul 28, 2023

Daniele Mari, Davide Salvi, Paolo Bestagini, Simone Milani

Figure 1 for All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection

Figure 2 for All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection

Figure 3 for All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection

Figure 4 for All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection

Share this with someone who'll enjoy it:

Abstract:Recent advances in deep learning and computer vision have made the synthesis and counterfeiting of multimedia content more accessible than ever, leading to possible threats and dangers from malicious users. In the audio field, we are witnessing the growth of speech deepfake generation techniques, which solicit the development of synthetic speech detection algorithms to counter possible mischievous uses such as frauds or identity thefts. In this paper, we consider three different feature sets proposed in the literature for the synthetic speech detection task and present a model that fuses them, achieving overall better performances with respect to the state-of-the-art solutions. The system was tested on different scenarios and datasets to prove its robustness to anti-forensic attacks and its generalization capabilities.

* Accepted at ECML-PKDD 2023 Workshop "Deep Learning and Multimedia Forensics. Combating fake media and misinformation"

View paper on

Share this with someone who'll enjoy it:

Title:All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection

Paper and Code