Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yixuan Ji

Multi-modal Speech Emotion Recognition via Feature Distribution Adaptation Network

Nov 02, 2024

Shaokai Li, Yixuan Ji, Peng Song, Haoqin Sun, Wenming Zheng

Figure 1 for Multi-modal Speech Emotion Recognition via Feature Distribution Adaptation Network

Figure 2 for Multi-modal Speech Emotion Recognition via Feature Distribution Adaptation Network

Figure 3 for Multi-modal Speech Emotion Recognition via Feature Distribution Adaptation Network

Figure 4 for Multi-modal Speech Emotion Recognition via Feature Distribution Adaptation Network

Abstract:In this paper, we propose a novel deep inductive transfer learning framework, named feature distribution adaptation network, to tackle the challenging multi-modal speech emotion recognition problem. Our method aims to use deep transfer learning strategies to align visual and audio feature distributions to obtain consistent representation of emotion, thereby improving the performance of speech emotion recognition. In our model, the pre-trained ResNet-34 is utilized for feature extraction for facial expression images and acoustic Mel spectrograms, respectively. Then, the cross-attention mechanism is introduced to model the intrinsic similarity relationships of multi-modal features. Finally, the multi-modal feature distribution adaptation is performed efficiently with feed-forward network, which is extended using the local maximum mean discrepancy loss. Experiments are carried out on two benchmark datasets, and the results demonstrate that our model can achieve excellent performance compared with existing ones.

Via

Access Paper or Ask Questions

Feature distribution Adaptation Network for Speech Emotion Recognition

Oct 29, 2024

Shaokai Li, Yixuan Ji, Peng Song, Haoqin Sun, Wenming Zheng

Figure 1 for Feature distribution Adaptation Network for Speech Emotion Recognition

Figure 2 for Feature distribution Adaptation Network for Speech Emotion Recognition

Figure 3 for Feature distribution Adaptation Network for Speech Emotion Recognition

Figure 4 for Feature distribution Adaptation Network for Speech Emotion Recognition

Via

Access Paper or Ask Questions