Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks

Oct 07, 2021

Navin Raj Prabhu, Guillaume Carbajal, Nale Lehmann-Willenbrock, Timo Gerkmann

Figure 1 for End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks

Figure 2 for End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks

Figure 3 for End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks

Figure 4 for End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks

Share this with someone who'll enjoy it:

Abstract:Emotions are subjective constructs. Recent end-to-end speech emotion recognition systems are typically agnostic to the subjective nature of emotions, despite their state-of-the-art performances. In this work, we introduce an end-to-end Bayesian neural network architecture to capture the inherent subjectivity in emotions. To the best of our knowledge, this work is the first to use Bayesian neural networks for speech emotion recognition. At training, the network learns a distribution of weights to capture the inherent uncertainty related to subjective emotion annotations. For this, we introduce a loss term which enables the model to be explicitly trained on a distribution of emotion annotations, rather than training them exclusively on mean or gold-standard labels. We evaluate the proposed approach on the AVEC'16 emotion recognition dataset. Qualitative and quantitative analysis of the results reveal that the proposed model can aptly capture the distribution of subjective emotion annotations with a compromise between mean and standard deviation estimations.

* (c) 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

View paper on

Share this with someone who'll enjoy it:

Title:End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks

Paper and Code