Picture for Jianyi Chen

Jianyi Chen

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Add code
Feb 06, 2025
Viaarxiv icon

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Add code
Aug 30, 2024
Figure 1 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 2 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 3 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 4 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Viaarxiv icon

FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation

Add code
May 13, 2024
Viaarxiv icon

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Add code
Apr 25, 2024
Figure 1 for FlashSpeech: Efficient Zero-Shot Speech Synthesis
Figure 2 for FlashSpeech: Efficient Zero-Shot Speech Synthesis
Figure 3 for FlashSpeech: Efficient Zero-Shot Speech Synthesis
Figure 4 for FlashSpeech: Efficient Zero-Shot Speech Synthesis
Viaarxiv icon