Picture for Heeseung Kim

Heeseung Kim

Does Your Voice Assistant Remember? Analyzing Conversational Context Recall and Utilization in Voice Interaction Models

Add code
Feb 27, 2025
Viaarxiv icon

EdiText: Controllable Coarse-to-Fine Text Editing with Diffusion Language Models

Add code
Feb 27, 2025
Viaarxiv icon

Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator

Add code
Nov 23, 2024
Viaarxiv icon

Style-Friendly SNR Sampler for Style-Driven Generation

Add code
Nov 22, 2024
Figure 1 for Style-Friendly SNR Sampler for Style-Driven Generation
Figure 2 for Style-Friendly SNR Sampler for Style-Driven Generation
Figure 3 for Style-Friendly SNR Sampler for Style-Driven Generation
Figure 4 for Style-Friendly SNR Sampler for Style-Driven Generation
Viaarxiv icon

NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers

Add code
Sep 24, 2024
Viaarxiv icon

VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance

Add code
Sep 24, 2024
Viaarxiv icon

VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech

Add code
Aug 27, 2024
Figure 1 for VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech
Figure 2 for VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech
Figure 3 for VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech
Figure 4 for VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech
Viaarxiv icon

HyperCLOVA X Technical Report

Add code
Apr 13, 2024
Viaarxiv icon

Unified Speech-Text Pretraining for Spoken Dialog Modeling

Add code
Feb 08, 2024
Figure 1 for Unified Speech-Text Pretraining for Spoken Dialog Modeling
Figure 2 for Unified Speech-Text Pretraining for Spoken Dialog Modeling
Figure 3 for Unified Speech-Text Pretraining for Spoken Dialog Modeling
Figure 4 for Unified Speech-Text Pretraining for Spoken Dialog Modeling
Viaarxiv icon

UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data

Add code
Jun 28, 2023
Viaarxiv icon