Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ching-Yu Kao

Visualizing Automatic Speech Recognition -- Means for a Better Understanding?

Feb 01, 2022

Karla Markert, Romain Parracone, Mykhailo Kulakov, Philip Sperl, Ching-Yu Kao, Konstantin Böttinger

Figure 1 for Visualizing Automatic Speech Recognition -- Means for a Better Understanding?

Figure 2 for Visualizing Automatic Speech Recognition -- Means for a Better Understanding?

Figure 3 for Visualizing Automatic Speech Recognition -- Means for a Better Understanding?

Figure 4 for Visualizing Automatic Speech Recognition -- Means for a Better Understanding?

Abstract:Automatic speech recognition (ASR) is improving ever more at mimicking human speech processing. The functioning of ASR, however, remains to a large extent obfuscated by the complex structure of the deep neural networks (DNNs) they are based on. In this paper, we show how so-called attribution methods, that we import from image recognition and suitably adapt to handle audio data, can help to clarify the working of ASR. Taking DeepSpeech, an end-to-end model for ASR, as a case study, we show how these techniques help to visualize which features of the input are the most influential in determining the output. We focus on three visualization techniques: Layer-wise Relevance Propagation (LRP), Saliency Maps, and Shapley Additive Explanations (SHAP). We compare these methods and discuss potential further applications, such as in the detection of adversarial examples.

* Proc. 2021 ISCA Symposium on Security and Privacy in Speech Communication

Via

Access Paper or Ask Questions

DLA: Dense-Layer-Analysis for Adversarial Example Detection

Nov 05, 2019

Philip Sperl, Ching-Yu Kao, Peng Chen, Konstantin Böttinger

Figure 1 for DLA: Dense-Layer-Analysis for Adversarial Example Detection

Figure 2 for DLA: Dense-Layer-Analysis for Adversarial Example Detection

Figure 3 for DLA: Dense-Layer-Analysis for Adversarial Example Detection

Figure 4 for DLA: Dense-Layer-Analysis for Adversarial Example Detection

Abstract:In recent years Deep Neural Networks (DNNs) have achieved remarkable results and even showed super-human capabilities in a broad range of domains. This led people to trust in DNNs' classifications and resulting actions even in security-sensitive environments like autonomous driving. Despite their impressive achievements, DNNs are known to be vulnerable to adversarial examples. Such inputs contain small perturbations to intentionally fool the attacked model. In this paper, we present a novel end-to-end framework to detect such attacks during classification without influencing the target model's performance. Inspired by recent research in neuron-coverage guided testing we show that dense layers of DNNs carry security-sensitive information. With a secondary DNN we analyze the activation patterns of the dense layers during classification runtime, which enables effective and real-time detection of adversarial examples. Our prototype implementation successfully detects adversarial examples in image, natural language, and audio processing. Thereby, we cover a variety of target DNNs, including Long Short Term Memory (LSTM) architectures. In addition, to effectively defend against state-of-the-art attacks, our approach generalizes between different sets of adversarial examples. Thus, our method most likely enables us to detect even future, yet unknown attacks. Finally, during white-box adaptive attacks, we show our method cannot be easily bypassed.

Via

Access Paper or Ask Questions