Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chung-Yi Li

Language Representation in Multilingual BERT and its applications to improve Cross-lingual Generalization

Oct 23, 2020

Chi-Liang Liu, Tsung-Yuan Hsu, Yung-Sung Chuang, Chung-Yi Li, Hung-yi Lee

Figure 1 for Language Representation in Multilingual BERT and its applications to improve Cross-lingual Generalization

Figure 2 for Language Representation in Multilingual BERT and its applications to improve Cross-lingual Generalization

Figure 3 for Language Representation in Multilingual BERT and its applications to improve Cross-lingual Generalization

Figure 4 for Language Representation in Multilingual BERT and its applications to improve Cross-lingual Generalization

Abstract:A token embedding in multilingual BERT (m-BERT) contains both language and semantic information. We find that representation of a language can be obtained by simply averaging the embeddings of the tokens of the language. With the language representation, we can control the output languages of multilingual BERT by manipulating the token embeddings and achieve unsupervised token translation. We further propose a computationally cheap but effective approach to improve the cross-lingual ability of m-BERT based on the observation.

* preprint

Via

Access Paper or Ask Questions

What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis

Nov 04, 2019

Chung-Yi Li, Pei-Chieh Yuan, Hung-Yi Lee

Figure 1 for What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis

Figure 2 for What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis

Figure 3 for What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis

Figure 4 for What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis

Abstract:End-to-end speech recognition systems have achieved competitive results compared to traditional systems. However, the complex transformations involved between layers given highly variable acoustic signals are hard to analyze. In this paper, we present our ASR probing model, which synthesizes speech from hidden representations of end-to-end ASR to examine the information maintain after each layer calculation. Listening to the synthesized speech, we observe gradual removal of speaker variability and noise as the layer goes deeper, which aligns with the previous studies on how deep network functions in speech recognition. This paper is the first study analyzing the end-to-end speech recognition model by demonstrating what each layer hears. Speaker verification and speech enhancement measurements on synthesized speech are also conducted to confirm our observation further.

* submitted to ICASSP 2020

Via

Access Paper or Ask Questions