Picture for Zirun Zhu

Zirun Zhu

E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS

Add code
Jun 26, 2024
Figure 1 for E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
Figure 2 for E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
Figure 3 for E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
Figure 4 for E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
Viaarxiv icon

An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS

Add code
Jun 09, 2024
Figure 1 for An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS
Figure 2 for An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS
Figure 3 for An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS
Viaarxiv icon

Total-Duration-Aware Duration Modeling for Text-to-Speech Systems

Add code
Jun 06, 2024
Viaarxiv icon

Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like

Add code
Feb 12, 2024
Viaarxiv icon

Real-Time Audio-Visual End-to-End Speech Enhancement

Add code
Mar 13, 2023
Viaarxiv icon

VarArray: Array-Geometry-Agnostic Continuous Speech Separation

Add code
Oct 26, 2021
Figure 1 for VarArray: Array-Geometry-Agnostic Continuous Speech Separation
Figure 2 for VarArray: Array-Geometry-Agnostic Continuous Speech Separation
Figure 3 for VarArray: Array-Geometry-Agnostic Continuous Speech Separation
Viaarxiv icon

Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement

Add code
Jun 05, 2021
Figure 1 for Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement
Figure 2 for Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement
Figure 3 for Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement
Viaarxiv icon