Picture for Wonjune Kang

Wonjune Kang

Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech

Add code
Oct 02, 2024
Figure 1 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Figure 2 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Figure 3 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Figure 4 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Viaarxiv icon

Prompting Large Language Models with Audio for General-Purpose Speech Summarization

Add code
Jun 10, 2024
Viaarxiv icon

Multi-Task Learning for Front-End Text Processing in TTS

Add code
Jan 12, 2024
Figure 1 for Multi-Task Learning for Front-End Text Processing in TTS
Figure 2 for Multi-Task Learning for Front-End Text Processing in TTS
Figure 3 for Multi-Task Learning for Front-End Text Processing in TTS
Viaarxiv icon

ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings

Add code
May 23, 2023
Viaarxiv icon

End-to-End Zero-Shot Voice Style Transfer with Location-Variable Convolutions

Add code
May 19, 2022
Figure 1 for End-to-End Zero-Shot Voice Style Transfer with Location-Variable Convolutions
Figure 2 for End-to-End Zero-Shot Voice Style Transfer with Location-Variable Convolutions
Figure 3 for End-to-End Zero-Shot Voice Style Transfer with Location-Variable Convolutions
Figure 4 for End-to-End Zero-Shot Voice Style Transfer with Location-Variable Convolutions
Viaarxiv icon