Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Leveraging Recent Advances in Deep Learning for Audio-Visual Emotion Recognition

Mar 16, 2021

Liam Schoneveld, Alice Othmani, Hazem Abdelkawy

Figure 1 for Leveraging Recent Advances in Deep Learning for Audio-Visual Emotion Recognition

Figure 2 for Leveraging Recent Advances in Deep Learning for Audio-Visual Emotion Recognition

Figure 3 for Leveraging Recent Advances in Deep Learning for Audio-Visual Emotion Recognition

Figure 4 for Leveraging Recent Advances in Deep Learning for Audio-Visual Emotion Recognition

Share this with someone who'll enjoy it:

Abstract:Emotional expressions are the behaviors that communicate our emotional state or attitude to others. They are expressed through verbal and non-verbal communication. Complex human behavior can be understood by studying physical features from multiple modalities; mainly facial, vocal and physical gestures. Recently, spontaneous multi-modal emotion recognition has been extensively studied for human behavior analysis. In this paper, we propose a new deep learning-based approach for audio-visual emotion recognition. Our approach leverages recent advances in deep learning like knowledge distillation and high-performing deep architectures. The deep feature representations of the audio and visual modalities are fused based on a model-level fusion strategy. A recurrent neural network is then used to capture the temporal dynamics. Our proposed approach substantially outperforms state-of-the-art approaches in predicting valence on the RECOLA dataset. Moreover, our proposed visual facial expression feature extraction network outperforms state-of-the-art results on the AffectNet and Google Facial Expression Comparison datasets.

* 8 pages, 3 figures, Pattern Recognition Letters

View paper on

Share this with someone who'll enjoy it:

Title:Leveraging Recent Advances in Deep Learning for Audio-Visual Emotion Recognition

Paper and Code