Picture for Haoqin Sun

Haoqin Sun

CS-Dialogue: A 104-Hour Dataset of Spontaneous Mandarin-English Code-Switching Dialogues for Speech Recognition

Add code
Feb 26, 2025
Viaarxiv icon

MusicEval: A Generative Music Corpus with Expert Ratings for Automatic Text-to-Music Evaluation

Add code
Jan 18, 2025
Figure 1 for MusicEval: A Generative Music Corpus with Expert Ratings for Automatic Text-to-Music Evaluation
Figure 2 for MusicEval: A Generative Music Corpus with Expert Ratings for Automatic Text-to-Music Evaluation
Figure 3 for MusicEval: A Generative Music Corpus with Expert Ratings for Automatic Text-to-Music Evaluation
Figure 4 for MusicEval: A Generative Music Corpus with Expert Ratings for Automatic Text-to-Music Evaluation
Viaarxiv icon

Enhancing Multimodal Emotion Recognition through Multi-Granularity Cross-Modal Alignment

Add code
Dec 30, 2024
Viaarxiv icon

Multi-modal Speech Emotion Recognition via Feature Distribution Adaptation Network

Add code
Nov 02, 2024
Figure 1 for Multi-modal Speech Emotion Recognition via Feature Distribution Adaptation Network
Figure 2 for Multi-modal Speech Emotion Recognition via Feature Distribution Adaptation Network
Figure 3 for Multi-modal Speech Emotion Recognition via Feature Distribution Adaptation Network
Figure 4 for Multi-modal Speech Emotion Recognition via Feature Distribution Adaptation Network
Viaarxiv icon

Feature distribution Adaptation Network for Speech Emotion Recognition

Add code
Oct 29, 2024
Figure 1 for Feature distribution Adaptation Network for Speech Emotion Recognition
Figure 2 for Feature distribution Adaptation Network for Speech Emotion Recognition
Figure 3 for Feature distribution Adaptation Network for Speech Emotion Recognition
Figure 4 for Feature distribution Adaptation Network for Speech Emotion Recognition
Viaarxiv icon

ChildMandarin: A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5

Add code
Sep 27, 2024
Figure 1 for ChildMandarin: A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5
Figure 2 for ChildMandarin: A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5
Figure 3 for ChildMandarin: A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5
Figure 4 for ChildMandarin: A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5
Viaarxiv icon

M2R-Whisper: Multi-stage and Multi-scale Retrieval Augmentation for Enhancing Whisper

Add code
Sep 18, 2024
Viaarxiv icon

Uncertainty-Aware Mean Opinion Score Prediction

Add code
Aug 23, 2024
Viaarxiv icon

Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition

Add code
Aug 01, 2024
Figure 1 for Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition
Figure 2 for Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition
Figure 3 for Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition
Figure 4 for Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition
Viaarxiv icon

Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework

Add code
Jul 12, 2024
Figure 1 for Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework
Figure 2 for Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework
Figure 3 for Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework
Figure 4 for Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework
Viaarxiv icon