Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Spoken Language Identification using ConvNets

Oct 09, 2019

Sarthak, Shikhar Shukla, Govind Mittal

Figure 1 for Spoken Language Identification using ConvNets

Figure 2 for Spoken Language Identification using ConvNets

Figure 3 for Spoken Language Identification using ConvNets

Figure 4 for Spoken Language Identification using ConvNets

Share this with someone who'll enjoy it:

Abstract:Language Identification (LI) is an important first step in several speech processing systems. With a growing number of voice-based assistants, speech LI has emerged as a widely researched field. To approach the problem of identifying languages, we can either adopt an implicit approach where only the speech for a language is present or an explicit one where text is available with its corresponding transcript. This paper focuses on an implicit approach due to the absence of transcriptive data. This paper benchmarks existing models and proposes a new attention based model for language identification which uses log-Mel spectrogram images as input. We also present the effectiveness of raw waveforms as features to neural network models for LI tasks. For training and evaluation of models, we classified six languages (English, French, German, Spanish, Russian and Italian) with an accuracy of 95.4% and four languages (English, French, German, Spanish) with an accuracy of 96.3% obtained from the VoxForge dataset. This approach can further be scaled to incorporate more languages.

* 2019 European Conference on Ambient Intelligence

View paper on

Share this with someone who'll enjoy it:

Title:Spoken Language Identification using ConvNets

Paper and Code