Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Synthesized Speech Detection Using Convolutional Transformer-Based Spectrogram Analysis

May 03, 2022

Emily R. Bartusiak, Edward J. Delp

Figure 1 for Synthesized Speech Detection Using Convolutional Transformer-Based Spectrogram Analysis

Figure 2 for Synthesized Speech Detection Using Convolutional Transformer-Based Spectrogram Analysis

Figure 3 for Synthesized Speech Detection Using Convolutional Transformer-Based Spectrogram Analysis

Figure 4 for Synthesized Speech Detection Using Convolutional Transformer-Based Spectrogram Analysis

Share this with someone who'll enjoy it:

Abstract:Synthesized speech is common today due to the prevalence of virtual assistants, easy-to-use tools for generating and modifying speech signals, and remote work practices. Synthesized speech can also be used for nefarious purposes, including creating a purported speech signal and attributing it to someone who did not speak the content of the signal. We need methods to detect if a speech signal is synthesized. In this paper, we analyze speech signals in the form of spectrograms with a Compact Convolutional Transformer (CCT) for synthesized speech detection. A CCT utilizes a convolutional layer that introduces inductive biases and shared weights into a network, allowing a transformer architecture to perform well with fewer data samples used for training. The CCT uses an attention mechanism to incorporate information from all parts of a signal under analysis. Trained on both genuine human voice signals and synthesized human voice signals, we demonstrate that our CCT approach successfully differentiates between genuine and synthesized speech signals.

* IEEE Asilomar Conference on Signals, Systems, and Computers, pp. 1426-1430, October 2021, Asilomar, CA * Accepted to the 2021 IEEE Asilomar Conference on Signals, Systems, and Computers

View paper on

Share this with someone who'll enjoy it:

Title:Synthesized Speech Detection Using Convolutional Transformer-Based Spectrogram Analysis

Paper and Code