Picture for Yahuan Cong

Yahuan Cong

GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech

Add code
Jun 27, 2023
Figure 1 for GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech
Figure 2 for GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech
Figure 3 for GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech
Figure 4 for GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech
Viaarxiv icon

Self-supervised learning for audio-visual speaker diarization

Add code
Feb 13, 2020
Figure 1 for Self-supervised learning for audio-visual speaker diarization
Figure 2 for Self-supervised learning for audio-visual speaker diarization
Figure 3 for Self-supervised learning for audio-visual speaker diarization
Figure 4 for Self-supervised learning for audio-visual speaker diarization
Viaarxiv icon