Picture for Zhengdong Yang

Zhengdong Yang

When Large Language Models Meet Speech: A Survey on Integration Approaches

Add code
Feb 26, 2025
Viaarxiv icon

Cross-lingual Embedding Clustering for Hierarchical Softmax in Low-Resource Multilingual Speech Recognition

Add code
Jan 29, 2025
Figure 1 for Cross-lingual Embedding Clustering for Hierarchical Softmax in Low-Resource Multilingual Speech Recognition
Figure 2 for Cross-lingual Embedding Clustering for Hierarchical Softmax in Low-Resource Multilingual Speech Recognition
Figure 3 for Cross-lingual Embedding Clustering for Hierarchical Softmax in Low-Resource Multilingual Speech Recognition
Figure 4 for Cross-lingual Embedding Clustering for Hierarchical Softmax in Low-Resource Multilingual Speech Recognition
Viaarxiv icon

MELD-ST: An Emotion-aware Speech Translation Dataset

Add code
May 21, 2024
Viaarxiv icon

MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction

Add code
Jan 25, 2024
Viaarxiv icon

FedCPC: An Effective Federated Contrastive Learning Method for Privacy Preserving Early-Stage Alzheimer's Speech Detection

Add code
Nov 21, 2023
Viaarxiv icon

Fusion of Self-supervised Learned Models for MOS Prediction

Add code
Apr 11, 2022
Figure 1 for Fusion of Self-supervised Learned Models for MOS Prediction
Figure 2 for Fusion of Self-supervised Learned Models for MOS Prediction
Figure 3 for Fusion of Self-supervised Learned Models for MOS Prediction
Figure 4 for Fusion of Self-supervised Learned Models for MOS Prediction
Viaarxiv icon