Picture for Trung Hieu Nguyen

Trung Hieu Nguyen

Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions

Add code
Sep 25, 2024
Viaarxiv icon

MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation

Add code
Dec 19, 2023
Viaarxiv icon

SPGM: Prioritizing Local Features for enhanced speech separation performance

Add code
Sep 22, 2023
Viaarxiv icon

Are Soft Prompts Good Zero-shot Learners for Speech Recognition?

Add code
Sep 18, 2023
Viaarxiv icon

ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention

Add code
May 20, 2023
Viaarxiv icon

Contrastive Speech Mixup for Low-resource Keyword Spotting

Add code
May 02, 2023
Viaarxiv icon

Adaptive Knowledge Distillation between Text and Speech Pre-trained Models

Add code
Mar 07, 2023
Viaarxiv icon

Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses

Add code
Feb 03, 2021
Figure 1 for Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses
Figure 2 for Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses
Figure 3 for Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses
Figure 4 for Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses
Viaarxiv icon

Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram

Add code
Feb 03, 2021
Figure 1 for Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram
Figure 2 for Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram
Figure 3 for Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram
Figure 4 for Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram
Viaarxiv icon

Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion

Add code
Oct 16, 2020
Figure 1 for Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion
Figure 2 for Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion
Figure 3 for Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion
Figure 4 for Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion
Viaarxiv icon