Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models

Apr 06, 2021

Zhiyun Lu, Wei Han, Yu Zhang, Liangliang Cao

Figure 1 for Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models

Figure 2 for Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models

Figure 3 for Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models

Figure 4 for Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models

Share this with someone who'll enjoy it:

Abstract:Although end-to-end automatic speech recognition (e2e ASR) models are widely deployed in many applications, there have been very few studies to understand models' robustness against adversarial perturbations. In this paper, we explore whether a targeted universal perturbation vector exists for e2e ASR models. Our goal is to find perturbations that can mislead the models to predict the given targeted transcript such as "thank you" or empty string on any input utterance. We study two different attacks, namely additive and prepending perturbations, and their performances on the state-of-the-art LAS, CTC and RNN-T models. We find that LAS is the most vulnerable to perturbations among the three models. RNN-T is more robust against additive perturbations, especially on long utterances. And CTC is robust against both additive and prepending perturbations. To attack RNN-T, we find prepending perturbation is more effective than the additive perturbation, and can mislead the models to predict the same short target on utterances of arbitrary length.

* Submitted to INTERSPEECH 2021

View paper on

Share this with someone who'll enjoy it:

Title:Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models

Paper and Code