Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Context-Dependent Acoustic Modeling without Explicit Phone Clustering

May 15, 2020

Tina Raissi, Eugen Beck, Ralf Schlüter, Hermann Ney

Figure 1 for Context-Dependent Acoustic Modeling without Explicit Phone Clustering

Figure 2 for Context-Dependent Acoustic Modeling without Explicit Phone Clustering

Figure 3 for Context-Dependent Acoustic Modeling without Explicit Phone Clustering

Figure 4 for Context-Dependent Acoustic Modeling without Explicit Phone Clustering

Share this with someone who'll enjoy it:

Abstract:Phoneme-based acoustic modeling of large vocabulary automatic speech recognition takes advantage of phoneme context. The large number of context-dependent (CD) phonemes and their highly varying statistics require tying or smoothing to enable robust training. Usually, Classification and Regression Trees are used for phonetic clustering, which is standard in Hidden Markov Model (HMM)-based systems. However, this solution introduces a secondary training objective and does not allow for end-to-end training. In this work, we address a direct phonetic context modeling for the hybrid Deep Neural Network (DNN)/HMM, that does not build on any phone clustering algorithm for the determination of the HMM state inventory. By performing different decompositions of the joint probability of the center phoneme state and its left and right contexts, we obtain a factorized network consisting of different components, trained jointly. Moreover, the representation of the phonetic context for the network relies on phoneme embeddings. The recognition accuracy of our proposed models on the Switchboard task is comparable and outperforms slightly the hybrid model using the standard state-tying decision trees.

* Submitted to Interspeech 2020

View paper on

Share this with someone who'll enjoy it:

Title:Context-Dependent Acoustic Modeling without Explicit Phone Clustering

Paper and Code