Picture for Hongfei Xue

Hongfei Xue

ControlMM: Controllable Masked Motion Generation

Add code
Oct 14, 2024
Viaarxiv icon

Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text

Add code
Sep 17, 2024
Viaarxiv icon

Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge

Add code
Sep 09, 2024
Figure 1 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Figure 2 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Figure 3 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Figure 4 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Viaarxiv icon

AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection

Add code
Jun 11, 2024
Figure 1 for AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection
Figure 2 for AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection
Figure 3 for AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection
Viaarxiv icon

Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets

Add code
May 06, 2024
Figure 1 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Figure 2 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Figure 3 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Figure 4 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Viaarxiv icon

E-chat: Emotion-sensitive Spoken Dialogue System with Large Language Models

Add code
Jan 06, 2024
Viaarxiv icon

BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition

Add code
Oct 08, 2023
Viaarxiv icon

SSHR: Leveraging Self-supervised Hierarchical Representations for Multilingual Automatic Speech Recognition

Add code
Sep 29, 2023
Viaarxiv icon

TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition

Add code
May 23, 2023
Viaarxiv icon

Fusing Global and Local Features for Generalized AI-Synthesized Image Detection

Add code
Mar 26, 2022
Figure 1 for Fusing Global and Local Features for Generalized AI-Synthesized Image Detection
Figure 2 for Fusing Global and Local Features for Generalized AI-Synthesized Image Detection
Figure 3 for Fusing Global and Local Features for Generalized AI-Synthesized Image Detection
Figure 4 for Fusing Global and Local Features for Generalized AI-Synthesized Image Detection
Viaarxiv icon