Picture for Qingyang Hong

Qingyang Hong

Dynamic Language Group-Based MoE: Enhancing Efficiency and Flexibility for Code-Switching Speech Recognition

Add code
Jul 26, 2024
Viaarxiv icon

LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation

Add code
Jun 12, 2024
Viaarxiv icon

MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis

Add code
Dec 28, 2023
Figure 1 for MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis
Figure 2 for MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis
Figure 3 for MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis
Figure 4 for MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis
Viaarxiv icon

ReFlow-TTS: A Rectified Flow Model for High-fidelity Text-to-Speech

Add code
Sep 29, 2023
Viaarxiv icon

Community Detection Graph Convolutional Network for Overlap-Aware Speaker Diarization

Add code
Jun 26, 2023
Viaarxiv icon

Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge

Add code
Jun 07, 2023
Figure 1 for Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge
Figure 2 for Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge
Figure 3 for Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge
Figure 4 for Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge
Viaarxiv icon

Towards A Unified Conformer Structure: from ASR to ASV Task

Add code
Nov 14, 2022
Viaarxiv icon

Spatial-aware Speaker Diarization for Multi-channel Multi-party Meeting

Add code
Sep 24, 2022
Figure 1 for Spatial-aware Speaker Diarization for Multi-channel Multi-party Meeting
Figure 2 for Spatial-aware Speaker Diarization for Multi-channel Multi-party Meeting
Figure 3 for Spatial-aware Speaker Diarization for Multi-channel Multi-party Meeting
Figure 4 for Spatial-aware Speaker Diarization for Multi-channel Multi-party Meeting
Viaarxiv icon

Deep Representation Decomposition for Rate-Invariant Speaker Verification

Add code
May 28, 2022
Figure 1 for Deep Representation Decomposition for Rate-Invariant Speaker Verification
Figure 2 for Deep Representation Decomposition for Rate-Invariant Speaker Verification
Figure 3 for Deep Representation Decomposition for Rate-Invariant Speaker Verification
Viaarxiv icon

Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data

Add code
Apr 25, 2022
Figure 1 for Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data
Figure 2 for Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data
Figure 3 for Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data
Figure 4 for Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data
Viaarxiv icon