Picture for Chanwoo Kim

Chanwoo Kim

Leveraging 2D Masked Reconstruction for Domain Adaptation of 3D Pose Estimation

Add code
Jan 14, 2025
Viaarxiv icon

Physics Informed Distillation for Diffusion Models

Add code
Nov 13, 2024
Figure 1 for Physics Informed Distillation for Diffusion Models
Figure 2 for Physics Informed Distillation for Diffusion Models
Figure 3 for Physics Informed Distillation for Diffusion Models
Figure 4 for Physics Informed Distillation for Diffusion Models
Viaarxiv icon

Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution

Add code
Sep 17, 2024
Viaarxiv icon

AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition

Add code
Mar 18, 2024
Figure 1 for AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition
Figure 2 for AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition
Figure 3 for AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition
Figure 4 for AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition
Viaarxiv icon

Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution

Add code
Jan 29, 2024
Viaarxiv icon

Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech

Add code
Jan 19, 2024
Viaarxiv icon

On the compression of shallow non-causal ASR models using knowledge distillation and tied-and-reduced decoder for low-latency on-device speech recognition

Add code
Dec 15, 2023
Viaarxiv icon

Class-Wise Buffer Management for Incremental Object Detection: An Effective Buffer Training Strategy

Add code
Dec 14, 2023
Viaarxiv icon

Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis

Add code
Oct 05, 2023
Viaarxiv icon

Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction

Add code
Aug 16, 2023
Viaarxiv icon