Picture for Yunlin Chen

Yunlin Chen

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

Add code
Mar 03, 2025
Viaarxiv icon

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Add code
Feb 06, 2025
Viaarxiv icon

Single-Codec: Single-Codebook Speech Codec towards High-Performance Speech Generation

Add code
Jun 11, 2024
Viaarxiv icon

SponTTS: modeling and transferring spontaneous style for TTS

Add code
Nov 13, 2023
Viaarxiv icon

PromptSpeaker: Speaker Generation Based on Text Descriptions

Add code
Oct 08, 2023
Figure 1 for PromptSpeaker: Speaker Generation Based on Text Descriptions
Figure 2 for PromptSpeaker: Speaker Generation Based on Text Descriptions
Figure 3 for PromptSpeaker: Speaker Generation Based on Text Descriptions
Figure 4 for PromptSpeaker: Speaker Generation Based on Text Descriptions
Viaarxiv icon

PromptStyle: Controllable Style Transfer for Text-to-Speech with Natural Language Descriptions

Add code
Jun 01, 2023
Viaarxiv icon