Picture for Wonjune Kang

Wonjune Kang

Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech

Add code
Oct 02, 2024
Viaarxiv icon

Prompting Large Language Models with Audio for General-Purpose Speech Summarization

Add code
Jun 10, 2024
Viaarxiv icon

Multi-Task Learning for Front-End Text Processing in TTS

Add code
Jan 12, 2024
Figure 1 for Multi-Task Learning for Front-End Text Processing in TTS
Figure 2 for Multi-Task Learning for Front-End Text Processing in TTS
Figure 3 for Multi-Task Learning for Front-End Text Processing in TTS
Viaarxiv icon

ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings

Add code
May 23, 2023
Viaarxiv icon

End-to-End Zero-Shot Voice Style Transfer with Location-Variable Convolutions

Add code
May 19, 2022
Figure 1 for End-to-End Zero-Shot Voice Style Transfer with Location-Variable Convolutions
Figure 2 for End-to-End Zero-Shot Voice Style Transfer with Location-Variable Convolutions
Figure 3 for End-to-End Zero-Shot Voice Style Transfer with Location-Variable Convolutions
Figure 4 for End-to-End Zero-Shot Voice Style Transfer with Location-Variable Convolutions
Viaarxiv icon