Picture for Xiaobin Zhuang

Xiaobin Zhuang

Improving Audio Generation with Visual Enhanced Caption

Add code
Jul 05, 2024
Viaarxiv icon

Seed-TTS: A Family of High-Quality Versatile Speech Generation Models

Add code
Jun 04, 2024
Figure 1 for Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
Figure 2 for Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
Figure 3 for Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
Figure 4 for Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
Viaarxiv icon

KaraTuner: Towards end to end natural pitch correction for singing voice in karaoke

Add code
Oct 18, 2021
Figure 1 for KaraTuner: Towards end to end natural pitch correction for singing voice in karaoke
Figure 2 for KaraTuner: Towards end to end natural pitch correction for singing voice in karaoke
Figure 3 for KaraTuner: Towards end to end natural pitch correction for singing voice in karaoke
Figure 4 for KaraTuner: Towards end to end natural pitch correction for singing voice in karaoke
Viaarxiv icon