Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification

Feb 15, 2021

Bidisha Sharma, Maulik Madhavi, Haizhou Li

Figure 1 for Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification

Figure 2 for Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification

Figure 3 for Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification

Figure 4 for Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification

Share this with someone who'll enjoy it:

Abstract:Intent classification is a task in spoken language understanding. An intent classification system is usually implemented as a pipeline process, with a speech recognition module followed by text processing that classifies the intents. There are also studies of end-to-end system that takes acoustic features as input and classifies the intents directly. Such systems don't take advantage of relevant linguistic information, and suffer from limited training data. In this work, we propose a novel intent classification framework that employs acoustic features extracted from a pretrained speech recognition system and linguistic features learned from a pretrained language model. We use knowledge distillation technique to map the acoustic embeddings towards linguistic embeddings. We perform fusion of both acoustic and linguistic embeddings through cross-attention approach to classify intents. With the proposed method, we achieve 90.86% and 99.07% accuracy on ATIS and Fluent speech corpus, respectively.

View paper on

Share this with someone who'll enjoy it:

Title:Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification

Paper and Code