Picture for Chengxin Chen

Chengxin Chen

TRNet: Two-level Refinement Network leveraging Speech Enhancement for Noise Robust Speech Emotion Recognition

Add code
Apr 19, 2024
Figure 1 for TRNet: Two-level Refinement Network leveraging Speech Enhancement for Noise Robust Speech Emotion Recognition
Figure 2 for TRNet: Two-level Refinement Network leveraging Speech Enhancement for Noise Robust Speech Emotion Recognition
Figure 3 for TRNet: Two-level Refinement Network leveraging Speech Enhancement for Noise Robust Speech Emotion Recognition
Figure 4 for TRNet: Two-level Refinement Network leveraging Speech Enhancement for Noise Robust Speech Emotion Recognition
Viaarxiv icon

Modality-Collaborative Transformer with Hybrid Feature Reconstruction for Robust Emotion Recognition

Add code
Dec 26, 2023
Viaarxiv icon

DSNet: Disentangled Siamese Network with Neutral Calibration for Speech Emotion Recognition

Add code
Dec 25, 2023
Viaarxiv icon

Audio-Visual Scene Classification Using A Transfer Learning Based Joint Optimization Strategy

Add code
Apr 25, 2022
Figure 1 for Audio-Visual Scene Classification Using A Transfer Learning Based Joint Optimization Strategy
Figure 2 for Audio-Visual Scene Classification Using A Transfer Learning Based Joint Optimization Strategy
Figure 3 for Audio-Visual Scene Classification Using A Transfer Learning Based Joint Optimization Strategy
Figure 4 for Audio-Visual Scene Classification Using A Transfer Learning Based Joint Optimization Strategy
Viaarxiv icon

CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition

Add code
Mar 31, 2022
Figure 1 for CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition
Figure 2 for CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition
Figure 3 for CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition
Figure 4 for CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition
Viaarxiv icon