Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Seppo Enarvi

Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies

Sep 29, 2017

Seppo Enarvi, Peter Smit, Sami Virpioja, Mikko Kurimo

Figure 1 for Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies

Figure 2 for Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies

Figure 3 for Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies

Figure 4 for Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies

Abstract:Today, the vocabulary size for language models in large vocabulary speech recognition is typically several hundreds of thousands of words. While this is already sufficient in some applications, the out-of-vocabulary words are still limiting the usability in others. In agglutinative languages the vocabulary for conversational speech should include millions of word forms to cover the spelling variations due to colloquial pronunciations, in addition to the word compounding and inflections. Very large vocabularies are also needed, for example, when the recognition of rare proper names is important.

* IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 11, pp. 2085-2097, November 2017

Via

Access Paper or Ask Questions

TheanoLM - An Extensible Toolkit for Neural Network Language Modeling

Aug 08, 2016

Seppo Enarvi, Mikko Kurimo

Figure 1 for TheanoLM - An Extensible Toolkit for Neural Network Language Modeling

Figure 2 for TheanoLM - An Extensible Toolkit for Neural Network Language Modeling

Figure 3 for TheanoLM - An Extensible Toolkit for Neural Network Language Modeling

Figure 4 for TheanoLM - An Extensible Toolkit for Neural Network Language Modeling

Abstract:We present a new tool for training neural network language models (NNLMs), scoring sentences, and generating text. The tool has been written using Python library Theano, which allows researcher to easily extend it and tune any aspect of the training process. Regardless of the flexibility, Theano is able to generate extremely fast native code that can utilize a GPU or multiple CPU cores in order to parallelize the heavy numerical computations. The tool has been evaluated in difficult Finnish and English conversational speech recognition tasks, and significant improvement was obtained over our best back-off n-gram models. The results that we obtained in the Finnish task were compared to those from existing RNNLM and RWTHLM toolkits, and found to be as good or better, while training times were an order of magnitude shorter.

* Proc. Interspeech 2016, pp. 3052-3056

Via

Access Paper or Ask Questions