Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning

Jun 27, 2022

Zuheng Kang, Junqing Peng, Jianzong Wang, Jing Xiao

Figure 1 for SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning

Figure 2 for SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning

Figure 3 for SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning

Figure 4 for SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning

Share this with someone who'll enjoy it:

Abstract:Speech emotion recognition (SER) has many challenges, but one of the main challenges is that each framework does not have a unified standard. In this paper, we propose SpeechEQ, a framework for unifying SER tasks based on a multi-scale unified metric. This metric can be trained by Multitask Learning (MTL), which includes two emotion recognition tasks of Emotion States Category (EIS) and Emotion Intensity Scale (EIS), and two auxiliary tasks of phoneme recognition and gender recognition. For this framework, we build a Mandarin SER dataset - SpeechEQ Dataset (SEQD). We conducted experiments on the public CASIA and ESD datasets in Mandarin, which exhibit that our method outperforms baseline methods by a relatively large margin, yielding 8.0\% and 6.5\% improvement in accuracy respectively. Additional experiments on IEMOCAP with four emotion categories (i.e., angry, happy, sad, and neutral) also show the proposed method achieves a state-of-the-art of both weighted accuracy (WA) of 78.16% and unweighted accuracy (UA) of 77.47%.

View paper on

Share this with someone who'll enjoy it:

Title:SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning

Paper and Code