Picture for Zhiping Xiu

Zhiping Xiu

Get Large Language Models Ready to Speak: A Late-fusion Approach for Speech Generation

Add code
Oct 27, 2024
Viaarxiv icon

Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens

Add code
Oct 04, 2024
Viaarxiv icon

Ultra-lightweight Neural Differential DSP Vocoder For High Quality Speech Synthesis

Add code
Jan 19, 2024
Viaarxiv icon

Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling

Add code
Apr 01, 2021
Figure 1 for Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling
Figure 2 for Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling
Figure 3 for Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling
Figure 4 for Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling
Viaarxiv icon