Picture for Minchan Kim

Minchan Kim

SegINR: Segment-wise Implicit Neural Representation for Sequence Alignment in Neural Text-to-Speech

Add code
Oct 07, 2024
Viaarxiv icon

CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction

Add code
Oct 02, 2024
Figure 1 for CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction
Figure 2 for CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction
Figure 3 for CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction
Figure 4 for CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction
Viaarxiv icon

High Fidelity Text-to-Speech Via Discrete Tokens Using Token Transducer and Group Masked Language Model

Add code
Jun 25, 2024
Viaarxiv icon

MakeSinger: A Semi-Supervised Training Method for Data-Efficient Singing Voice Synthesis via Classifier-free Diffusion Guidance

Add code
Jun 10, 2024
Viaarxiv icon

Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models

Add code
Mar 26, 2024
Viaarxiv icon

Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction

Add code
Jan 03, 2024
Viaarxiv icon

Efficient Parallel Audio Generation using Group Masked Language Modeling

Add code
Jan 02, 2024
Viaarxiv icon

Transduce and Speak: Neural Transducer for Text-to-Speech with Semantic Token Prediction

Add code
Nov 08, 2023
Viaarxiv icon

Pre- and post-contact policy decomposition for non-prehensile manipulation with zero-shot sim-to-real transfer

Add code
Sep 06, 2023
Figure 1 for Pre- and post-contact policy decomposition for non-prehensile manipulation with zero-shot sim-to-real transfer
Figure 2 for Pre- and post-contact policy decomposition for non-prehensile manipulation with zero-shot sim-to-real transfer
Figure 3 for Pre- and post-contact policy decomposition for non-prehensile manipulation with zero-shot sim-to-real transfer
Figure 4 for Pre- and post-contact policy decomposition for non-prehensile manipulation with zero-shot sim-to-real transfer
Viaarxiv icon

EM-Network: Oracle Guided Self-distillation for Sequence Learning

Add code
Jun 14, 2023
Figure 1 for EM-Network: Oracle Guided Self-distillation for Sequence Learning
Figure 2 for EM-Network: Oracle Guided Self-distillation for Sequence Learning
Figure 3 for EM-Network: Oracle Guided Self-distillation for Sequence Learning
Figure 4 for EM-Network: Oracle Guided Self-distillation for Sequence Learning
Viaarxiv icon